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(57) Abstract 

Hydrophobicatly-modified proteins and methods of making them are described. A hydrophobic moiety is attached to a surface amino 
acid residue of the protein. The hydrophobic moiety can be a lipid or a peptide. Alternatively, the protein can be derivatized by a wide 
variety of chemical reactions that append a hydropiiobic structure to the piotein. The preferred protein is of mammalian origin and is 
selected from the group consisting of Sonic. Indian, and Desert hedgehog. The hydrophobic moiety is used as a convenient tether to which 
may be attached a vesicle such as a cell membrane, liposome, or micelle. 
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Hydrophobically-modified Protein Compositions and Methods 



Background of the Invention 

It is known that certain proteins exhibit greater biological activity when attached 
5 to other moieties, either by formation of multimeric complexes, where the proteins have 
an opportunity to act in concert, or through other alterations in the protein's physico- 
chemical properties, such as the protein's absorption, biodistribution and half life. Thus, 
one ciurent area of research in biotechnology involves the development of methods to 
modify the physico-chemical properties of proteins so that they can be administered in 
1 0 smaller amounts, with fewer side effects, by new routes, and with less expense. 

For example, the bindmg afiSnity of any single protein (such as a ligand for its 
cognate receptor) may be low. However, cells normally express hundreds to thousands 
of copies of a particular surface receptor, and many receptor-ligand interactions take 
place simultaneously. When many surface molecules become involved in binding, the 

15 total effective affinity is greater than the sum of the binding aflRnities of the individual 
molecules. By contrast, when ligand proteins are removed from the cell surface and 
purified, or isolated by recombinant DNA techniques for use, e.g., as then^utics, they 
act as monomers and lose the advantage of acting in concert with many other copies of 
the same protein associated closely on the surface of a cell. Thus isolated, the low 

20 affinity of a protein for its receptor may become a serious drawback to its effectiveness 
as a ther^utic to block a particular binding pathway, since it must compete against the 
high avidity cell-cell interactions. Effective treatment might require constant 
administration and/or high doses. Such drawbacks might be avoided, however, if a 
means could be found to provide multimeric forms of an isolated protein. 

25 Similarly, it would be useful to modify other physico-chemical properties of 

biologically active proteins so that, for instance, a protein is induced to associate with a 
membrane thus localizing it at the site of administration and enhancing its ability to 
bind to, or otherwise mteract with, a particular target. Such changes may also affect the 
pharmaco-distribution of the protein. 

30 Several methods of generating coupled proteins have been developed. Many of 

these methods are not highly specific, i.e., they do not direct the point of coupling to 
any particular site on the protein. As a result, conventional coupling agents may attack 
functional sites or sterically block active sites, rendering the coupled proteins inactive. 
Furthermore, the coupled products may be oriented so that the active sites carmot act 

35 synergistically, thereby rendering the products no more effective than the monomeric 
protein alone. 
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As an additional motivation to find new methods for protein modification, 
proteins with an N-terminal cysteine residue are susceptible to oxidation or other 
chemical modifications that may compromise activity or half-life. Additionally, certain 
buffers commonly used in protein purification have components or impurities that can 
5 modify the N-terminal cysteine. Even when these buffers are avoided, the N-terminal 
cysteine is modified over time, perhaps due to chemicals in the storage vials or in the 
air. Consequently, formulation buffers must include a protective agent, such as 
dithiothreitol, to prevent cysteine modification and/or oxidation. However, protective 
agents have significant biological activity of their own and they may therefore 
1 0 complicate experiments and adversely affect the therapeutic utility of a formulation. 

Accordingly, there is a need in the art to develop more specific means to obtain 
derivatized products or multimeric forms thereof so as to alter the properties of the 
protein in order to affect its stability, potency, pharmacokinetics, and 
pharmacodynamics, 

15 

Summary of the Invention 

In one aspect of the invention, we have solved the problem of fmding a way to 
convenientiy make modified forms of biologically active proteins. Methods of the 
invention can be used to derive multimeric forms of the proteins and/or can be used to 

20 change their physico-chemical properties. We have found that modifying a protein 
(i.e, adding or appending a hydrophobic moiety to an existing amino acid or 
substituting a hydrophobic moiety for an amino acid) so as to introduce the 
hydrophobic moiety onto a protein can increase the protein's biological activity and/or 
its stability. For example, an N-terminal cysteine can be used as a convenient ''target'* 

25 to attach a hydrophobic moiety (e.g., a lipid) and thereby modify biologically active 
proteins. 

Alternatively, a hydrophobic moiety can be attached to a C-terminal residue of a 
biologically active protein, such as hedgehog protein, to modify the protein's activity. 
A hydrophobic moiety can also be appended to an internal amino acid residue to 
30 enhance the protein's activity, provided the modification does not affect the activity of 
the protein, e.g., the proteins ability to bind to a receptor or co-receptor, or affect the 
protein's 3-dimensional structure. Preferably, the hydrophobic moiety is appended to 
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an internal amino acid residue that is on the surface of the protein when the protein is in 
its native form. The hydrophobic modification of the invention provides a generically 
useful method of creating proteins with altered physico-chemical properties as 
compared to non-modified forms. 

5 This invention originated from the discovery that when we expressed fiill-length 

Sonic hedgehog protein in insect and in mammalian cells, the mature form of the 
protein (residues 1-1 74 in the mature sequence), in addition to having cholesterol at the 
C-terminus, is also derivatized at its N-terminal end with a fatty acid. Significantly, 
this form of hedgehog exhibited about a 30-fold increase in potency as compared to 
1 0 soluble, unmodified hedgehog in an in vitro assay. 

One aspect of the invention is therefore an isolated, protein comprising an N- 
terminal amino acid and a C-terminal amino acid, wherein the protein is selected from 
the group consisting of a protein with an N-terminal cysteine that is appended with at 
least one hydrophobic moiety; a protein with an N-terminal amino acid that is not a 
1 5 cysteine appended with a hydrophobic moiety; and a protein with a hydrophobic moiety 
substituted for the N-terminal amino acid. The hydrophobic moiety can be a 
hydrophobic peptide or any lipid or any other chemical moiety that is hydrophobic. 

The protein may be modified at its N-terminal amino acid and preferably the N- 
terminal amino acid is a cysteme or a functional derivative thereof. The protein may be 

20 modifed at its C-terminal amino acid or at both the N-terminal amino acid and the C- 
terminal amino acid, or at at least one amino acid internal to (i.e., intermediate between) 
the N-terminal and C-terminal amino acids, or various combinations of these 
configurations. The protein can be an extracellular signaling protein and in preferred 
embodiments, the protein is a hedgehog protein obtainable from a vertebrate source, 

25 most preferably obtainable from a human and includes Sonic, Indian, and Desert 
hedgehog. ^ 

Another embodiment is an isolated, protein of the form: A-Cys-[Sp]-B- X, 
wherein 

A is a hydrophobic moiety; 
30 Cys is a cysteine or functional equivalent thereof; 

[Sp] is an optional spacer peptide sequence; 
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B is a protein comprising a plurality of amino acids, including at least 
one optional spacer peptide sequence; and 

X is optionally another hydrophobic moiety linked to the protein. 

The isolated protein can be an extracellular signaling protein, preferably a 
5 hedgehog protein. This protein can be modified at at least one other amino acid with at 
least one hydrophobic moiety. In other embodiments, the protein is in contact with a 
vesicle in selected from the group consisting of a cell membrane, micelle and liposome. 

Another aspect of the hivention is an isolated, protein having a C-terminal 
amino acid and an N-terminal thiaproline group, the thiaproline group formed by 

1 0 reacting an aldehyde with an N-terminal cysteine of the protein. A further aspect of the 
invention is isolated, protein havmg a C-tcrminal amino acid and an N-terminal amide 
group, the amide group formed by reacting a fatty acid thioester with an N-terminal 
cysteine of the protein. Yet another aspect of the invention is an isolated, protein 
having a C-terminal amino acid and an N-terminal maleimide group, the N-terminal 

1 5 maleimide group formed by reacting a maleimide group with the N-terminal cysteine of 
the protein. . Yet another aspect of the invention is an isolated, protein having a C- 
terminal amino acid and an N-terminal acetamide group. A further aspect of the 
invention is an isolated, protein having a C-terminal amino acid and an N-terminal 
thiomorpholine group. 

20 In these embodiments, the C-terminal amino acid of the protein can be modified 

with an hydrophobic moiety. The isolated protein can be an extracellular signaling 
protein, most preferably a hedgehog protein. 

Methods of the invention include a method of generating a multivalent protein 
complex comprising the step of linking, in the presence of a vesicle, a hydrophobic 
25 moiety to an N-terminal cysteine of a protein, or a functional equivalent of the N- 
terminal cysteine. The linking step may include linking a lipid moiety which is 
selected from saturated and unsaturated fatty acids having between 2 and 24 carbon 
atoms. The protein can be an extracellular signaling protein, preferably a hedgehog 
protein selected from the group consisting of Sonic, Indian and Desert hedgehog. 

30 Yet another method of the invention is a method for modifying a physico- 

chemical property of a protein, comprising introducing at least one hydrophobic moiety 
to an N-terminal cysteine of the protein or to a functional equivalent of the N-terminal 
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cysteine. The hydrophobic moiety can be a lipid moiety selected from saturated and 
unsaturated fatty acids having between 2 and 24 carbon atoms. It can also be a 
hydrophobic protein The protein modified using this method can be an extracellular 
signaling protein, preferably a hedgehog protein selected from the group consisting of 
5 Sonic, Indian and Desert hedgehog. A protein complex, produced by these methods are 
also encompassed by the present invention. 

Other extracellular signaling proteins besides hedgehog include gelsolin; an 
interferon, an interleukin, tumor necrosis factor, monocyte colony stimulating factor, 
granulocyte colony stimulating factor, granulocyte macrophage colony stimulating 
1 0 factor, erythropoietin, platelet derived growth factor, growth hormone and insulin. 

Another method is a method for modifying a protein (such as an extracellular 
signaling protein) that has an N-terminal cysteine. This method comprises reacting the 
N-terminal cysteine with a fatty acid thioester to form an amide, wherein such 
modification enhances the protein's biological activity. 

15 Yet another method is a method for modifying a protein (such as an 

extracellular signaling protein) having an N-temiinal cysteine, which comprising 
reacting the N-terminal cysteine with a maleimide group, wherein such modification 
enhances the protein's biological activity. Other embodiments of this method involve 
reacting the N-terminal cysteine with either an aldehyde group, an acetamide group or a 

20 thiomorpholine group. 

A further method is a method for modifying protein (such as an extracellular 
signaling protein) comprising appending an hydrophobic peptide to the protein. The 
hydrophobic moiety can be appended to an amino acid of the protein selected from the 
group consisting of the N-terminal amino acid, the C-terminal amino acid, an amino 

25 acid intermediate between the N-tenninal amino acid and the C-terminal amino acid, 
and combinations of the foregoing. In one embodunent, the present invention provides 
hedgehog polypeptides which are modified with lipophilic moieties. In certain 
embodiments, the hedgehog proteins of the present invention are modified by a 
lipophilic moiety or moieties at one or more intenal sites of the mature, processed 

30 extracellular domain, and may or may not be also derivatized with lipophilic moieties at 
the N or C-terminal residues of the mature polypeptide. In other embodiments, the 
polypeptide is modified at the C-terminal residue with a hydrophobic moiety other than 
a sterol. In still other embodiments, the polypeptide is modified at the N-terminal 
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residue with a cyclic (preferably polycyclic) lipophilic group. Various combinations of 
the above are also contemplated.A therapeutic method of the invention is a method for 
treating a neurological disorder in a patient comprising adinmistering to the patient a 
hydrophobically-modified protein of the invention. 

5 

Brief Description of the Figures 

Fig. 1. Characterization of a palmitovlated form of Shh . A tethered form of 
human Shh was inunxmoaflSnity purified from High Five™ insect cells and analyzed by 
SDS-PAGE. The protein was stained with Coomassie blue (lane a, Life Technologies, 

10 Inc. prestained high molecular weight markers; lane b, soluble Shh (0,6 ^g); lane c, 
tethered Shh (0.6 ^g); lane d, mixture of soluble plus tethered Shh (0.6 ^g each)). The 
ability of Shh and Ihh (see lane h) to be modified with palmitic acid was ass^ed using 
a cell-free system described in Example 2. Soluble forms of hedgehog protem (3 
|ig/sample) were incubated for 1 h with rat liver microsomes, ATP, CoenzymeA, and 

15 ^H-palmitic acid, and then analyzed for palmitoylation by SDS-PAGE. The samples 
shown in lanes e-i were visualized by fluorography (lane e, Shh; lane f, des 1-10 Shh; 
lane g, Cys-1 to Ser Shh; lane h, Ihh; lane i, His-tagged Shh) and in lanes j-k by 
Coomassie staining (lane j, Shh; lane k des 1-10 Shh). 

Fig. 2. Analysis of purified Shh by ESI-MS . Soluble human Shh (A) and 
20 tethered human Shh (B) were analyzed by ESI-MS on a Micromass Quattro II triple 
quadrupole mass spectrometer, equipped with an electrospray ion source. All 
electrospray mass spectral data were acquired and stored in profile mode and were 
processed using the Micromass MassLynx data system. Molecular mass spectra are 
shown (mass assignments were generated by the data system). 

25 Fig. 3. Analysis of tethered Shh by reverse phase HPLC. Soluble human Shh 

(A), tethered human Shh from High Five™ insect cells (B), tethered human Shh from 
EBNA-293 cells (C), and cell-associated rat Shh (D) were subjected to reverse phase 
HPLC on a narrow bore Vydac C4 column (2.1 mm internal diameter x 250 mm). The 
column was developed with a 30 min, 0-80% acetonitrile gradient in 0.1% 

30 trifluoroacetic acid at 0.25 mL/min and the effluent monitored using a photodiode array 
detector from ?00-300 ran (data shown at 214 nm). Peak fractions were collected and 
characterized further by SDS-PAGE and MS (data summarized in Tables 3, 4, and 5). 
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Fig. 4. Characterization of Shh by LC-MS . Tethered human Shh (A) and 
soluble human Shh (B) were alkylated with 4-vinylpyridine (1 ^L/100 \iL sample in 6 
M guanidine HCl, I mM EDTA, 100 mM Tris HCl pH 8.0), ethanol precipitated, and 
digested with endoproteinase Lys-C in 50 mM Tris HCl pH 7.0, 2 M urea at an enzyme 

5 : protein ratio of 1 : 5 as described previously (27). The digests were analyzed by 
reverse phase HPLC in line with an electrospray Micromass Quattro II triple 
quadrupole mass spectrometer. Scans were acquired throughout the run and processed 
using the Micromass MassLynx data system (total ion chromatograms from the runs 
are shown). Asterisks indicate the positions of the N-terminal peptide which were 

1 0 verified either by MALDI PSD or N-temiinal Edman sequencing. 

Fig. 5. Sequencing of the N-terminal Shh peptide bv MALDI PSD 
measurement. The N-terminal endoproteinase Lys-C peptide from tethered human Shh 
was subjected to MALDI PSD measurement on a Voyager-DE™ STR time of flight 
mass spectrometer. The predicted fragmentation pattern and nomenclature for the 

15 detected fragment ions are shown at the top of the panel (PA, palmitoyi acid; 4vp, 4- 
pyridylethyl group). The remainder of the Figure shows the molecular mass spectrum 
produced by the run. Relevant ions are denoted using the nomenclature defined m the 
schematic. Calculated masses (Da) for b,- bg are 447.3, 504.3, 601.4, 658.4, 814.5, 
871.5, 1018.6, and 1075.6, respectively. For y,- yg, the masses (Da) are 147.1, 204.1, 

20 351.2, 408.2, 564.3, 621.3, 718,4, and 775.4, respectively. The calculated mass for z, is 
758.4 Da. The observed mass for bg contains an additional 18 Da due to an added 
water. 

Fig. 6. Increased activity of tethered Shh in the C3H10T1/2 assav. The 
relative potencies of soluble and tethered human Shh alone (A) or m the presence of the 

25 anti-hedgehog neutralizing Mab 5E1 (B) were assessed on C3H10T1/2 cells measuring 
alkaline phosphatase induction. The numbers presented reflect the averages of 
duplicate determinations. (A) Serial 2-fold dilutions of soluble (6) and tethered (8) 
Shh were mcubated with the cells for 5 days and the resulting levels of alkaline 
phosphatase activity measured at 405 nm using the alkaline phosphatase chromogenic 

30 substrate ;>-nitrophenyl phosphate. (B) Serial dilutions of Mab 5E1 were incubated 
with soluble Shh (5 |ig/mL: black bars) or tethered Shh (0.25 ^g/mL: gray bars) or 
vehicle control without Shh added (white bar) for 30 rain and then subjected to analysis 
in the C3HT101/2 assay. 
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Fig. 7. Analysis of Shh in a receptor binding assay. The relative potency of 
soluble (6) and tethered (8) Shh for binding to patched was assessed on patched- 
transfected EBNA-293 cells by FACS analysis. Serial dilutions of the test samples 
were incubated with the EBNA-293 cells, washed, and then the percent binding 
5 measured by the ability of the samples to compete with Shh-Ig for binding to the cells. 
Bound Shh-Ig was quantified by mean fluorescence using a FITC-labeled anti-Ig 
antibody probe as the readout. The data were fitted to a hyperbolic curve by non-linear 
regression. 

Fig. 8. Alignment of N-terminal fragment of human hedgehog proteins . The 20 
10 kDa human hedgehog proteins (Sonic *'Shh", Desert "Dhh" and Indian "Ihh") are 
aligned with respect to their N-terminal cysteine (Cys-1 in the mature sequence). This 
cysteine is normally Cys-24 in the full- length Shh precursor protein due to the 
presence of the natural signal sequence that is removed during secretion. The actual 
position of the cysteine may vary slightly due to species differences. 

15 Fig 9. Consensus Sequence of the N-terminal fragment of human hedgehog 

proteins> 

Fig. 10, Effect of lipid chain length on activity of human Sonic hedgehog. A 
series of fatty acid-modified hedgehog proteins was synthesized according to the 
present invention and the effect of the fatty acid chain length on hedgehog activity was 
20 tested using the C3H10T1/2 alkaline phosphatase induction assay described herein. 
The results are plotted as a bar graph. 

Fig. II. C3H10T1/2 assay of palmitovlated, rnvristvolated, laurovlated. 
decanovlated, and octanovlated human Sonic hedgehog . Palmitoylated, lauroylated, 
decanoylated, and octanoylated human Sonic hedgehog formulated in 5 mM Na2HP04 

25 pH 5,5, 150 mM NaCl, 1% octylglucoside, 0.5 mM DTT, and myristoylated human 
Sonic hedgehog, formulated in 150 mM NaCl, 0.5 mM DTT, were assayed on 
C3H10T1/2 cells measuring alkaline phosphatase induction. The numbers represent the 
mean of duplicate determinations. Serial 3-fold dilutions of palmitoylated (o), 
myristoylated (•), lauroylated (□), decanoylated (■), octanoylated (A), and 

30 unmodified (▲ and x) human Sonic hedgehog were incubated with the cells for 5 days 
and the resulting levels of alkaline phosphatase measured at 405 nm using the 
chromogenic substrate p-nitrophenyl phosphate. The palmitoylated, myristoylated, 
lauroylated, and decanoylated proteins were assayed in one experiment with the 
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xmmodified protein shown as (A), while the octanoylated protein was assayed in 
another experiment with the unmodified protein shown as (x). The arrow on the y-axis 
denotes the background level of alkaline phosphatase in the absence of added hedgehog 
protein. 

5 Fig. 12. Generic structures of various hvdroDhobicaHv-modified forms of 

hedgehog. (A) Fatty amide derivative where R = a hydrocarbon chain of a fatty acid; 
(B) thiazolidine derivative where R = a hydrocarbon; (C) amino acid substitution 
where R - a hydrophobic amino acid side chain; (D) maleimide derivative where R = a 
hydrocarbon; (E) SH = free thiol on N-terminal cysteine of wild type hedgehog; (F) an 
10 iodoacetamide derivative where R, = a hydrocarbon and Rj = either H or a 
hydrocarbon; and (G) thiomorpholinyl derivative where R = a hydrocarbon. For ail 
structures, HH = hedgehog. 

Fig. 13. Relative potencv of various hvdrophobicallv-modified forms of 
hedgehog in the C3H10T1/2 assav. The EC50 (2 jig/ml) of unmodified wild type human 
15 Sonic hedgehog is designated as 1 x. The potency of the other proteins is expressed as 
the ratio of the EC50 of wild type protein divided by the EC^o of the modified protein. 
Modifications are at the N-terminus of the protein unless designated otherwise. 

Fig. 14. Relative potencv of the unmodified, mvristovlated, and CI II mutant of 
human Sonic hedgehog in a malonate-induced rat striatal lesion assav . The figure 
20 shows the reduction in malonate-induced lesion volume which results from the 
administation of either unmodified, myristoylated, or the CI II mutant of human Sonic 
hedegehog to ti^e rat striatum. 

Fig. 15. illustrates the specific activities of maleimide modified and unmodified 
hedgehog polypeptides. 

25 

Detailed Description of the Invention 

This invention is based, in part, on the discovery that human Sonic hedgehog, 
expressed as a full-length construct in either insect or in mammalian cells, has a 
hydrophobic palmitoyl group appended to the a-amme of the N-terminal cysteine. This 
30 is the first example, of which the inventors are aware, of an extracellular signaling 
protein being modified in such a manner, and, in contrast to thiol-linked palmitic acid 
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modifications whose attachment is readily reversible, this novel N-linked palmitoyl 
moiety is likely to be veiy stable by analogy with myristic acid modification. 

As a direct consequence of this initial discovery, the inventors have found that 
increasing the hydrophobic nature of a signaling protein can increase the protein's 
5 biological activity. In particular, the inventors have found that appending a 
hydrophobic moiety to a signaling protein, such as a hedgehog protein, can enhance the 
protein's activity. The inventors have found that the N-terminal cysteine of 
biologically active proteins not only provides a convenient site for appending a 
hydrophobic moeity, and thereby modifying the physico^hemical properties of the 
10 protein, but modifications to the N-terminal cysteine can also increase the protein's 
stability. Additionally, addition of a hydrophobic moiety to an internal amino acid 
residue on the surface of the protein structure enhances the protein's activity. We use 
as an example, our discovery of hydrophobic (e.g., lipids and hydrophobic amino acid) 
modifications of hedgehog protein. 

15 One aspect of the present application is directed to the discovery that, in 

addition to those effects seen by cholesterol-addition to the C-terminus of extracellular 
fragments of the protein, at least certain of the biological activities of the hodutho,' 
gene products are unexpectedly potentiated by derivativation of the protein with 
lipophilic moieties at other sites on the protein and/or by moieties other than 

20 cholesterol Certain aspects of the invention are directed to preparations of hedgehog 
polypeptides which are modified at sites other than N-terminal or C-terminal residues 
of the natural processed form of the protein, and/or which are modified at such tenninal 
residues with lipophilic moieties other than a sterol at the C-terminus or fatty acid at the 
N-temiinus. 

25 As described in PCT publications WO 95/18856 and WO 96/17924 (all of 

which are expressly incorporated by reference herein), hedgehog polypeptides in 
general are useful in the in vitro and in vivo repairing and/or regulating the functional 
performance of a wide range of cells, tissues and organs, and have therapeutic uses 
ragning firom neuroprotection, neuroregeneration, enhancement of neural function, 

30 regulation of bone and cartilage formation and repair, regulation of spermatogenesis, 
regulation of lung, liver and other organs arising icom the primative gut, regulation of 
hematopoietic function, etc. Accordingly, the methods and compositions of the present 
invention include the use of the derivatized hedgehog polypeptides for all such uses as 
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hedgehog proteins have been implicated. Moreover, the subject methods can be 
performed on^cells which are provided in culture (in vitro), or on cells in a whole 
animal {in vivo). 

In one aspect, the present invention provides pharmaceutical preparations 
5 comprising, as an active ingredient, a hais^L h' v polypeptide being derivatized by one or 
more lipophiUc moieties such as described herein. 

The subject //c'l/Vr/^ v treatments are effective on both human and animal 
subjects. Animal subjects to which the invention is applicable extend to both domestic 
animals and livestock, raised either as pets or for commercial purposes. Examples are 
1 0 dogs, cats, cattle, horses, sheep, hogs and goats. 

The hedgehog proteins are a family of extracellular signaling proteins that 
regulate various aspects of embryonic development both in vertebrates and in 
invertebrates (for reviews see 1,2). The most well-characterized hedgehog protem is 
Sonic hedgehog (Shh), involved in anterior-posterior patterning, formation of an apical 

15 ectodermal ridge, hmdgut mesoderm, spinal column, distal limb, rib development, and 
lung development, and in inducing ventral cell types in the spinal cord, hindbrain and 
forebrain (3-8). While the mechanism of action of hedgehog proteins is not understood 
fully, the most^ recent biochemical and genetic data suggest that the receptor for Shh is 
the product of the tumor suppressor gene, patched (9,10) and that other protems; 

20 smoothened (10,11), Cubitus interruptus (12,13), and fitsed (14) are involved in the 
hedgehog signaling pathway. 

Human Shh is synthesized as a 45 kDa precursor protein that is cleaved 
autocatalytically to yield: (I) a 20 kDa N-terminal fragment that is responsible for all 
known hedgehog signaling activity (SEQ ID NOS. 1-4); and (II) a 25 kDa C-terminal 
25 fragment that contains the autoprocessing activity (15-17). The N-terminal fragment 
consists of amino acid residues 24-197 of the full-length precursor sequence. 

The N-termmal fragment remains membrane-associated through the addition of 
a cholesterol at its C-terminus (18,19). TTiis cholesterol is critical for restricting the 
tissue localization of the hedgehog signal. The addition of the cholesterol is catalyzed 
30 by the C-terminal domain during the processing step. 

All references cited in the detailed description are, unless otherwise stipulated, 
incorporated herein by reference. 
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I. Definitions 

The invention will now be described with reference to the following detailed 
description of which the following definitions are included: 

5 "amino acid"- a monomeric unit of a peptide, polypeptide, or protein. There are 
twenty amino acids found in naturally occurring peptides, polypeptides and proteins, all 
of which are L-isomers. The term also includes analogs of the amino acids and D- 
isomers of the protein amino acids and their analogs. 

"protein"- any polymer consistmg essentially of any of the 20 amino acids. 
10 Although "polypeptide" is often used in reference lo relatively large polypeptides, and 
"peptide" is often used in reference to small polypeptides, usage of these terms in the 
art overlaps and is varied. The term ^'protein" as used herein refers to peptides, proteins 
and polypeptides, unless otherwise noted. 

"N-terminal end"- refers to the first amino acid (amino acid number 1) of the 
1 5 mature form of a protein. 

"N-terminal cysteine"- refers to the amino acid residue (number 1) as shown in 
SEQ ID NOS. M. It also refers to any cysteine at position 1 of any other protein, or 
fiinctional equivalents of this cysteine (See Section IV). 

"spacer" sequence refers to a short sequence that can be as small as a single amino 
20 acid that may be inserted between an amino acid to be hydrophobically modified (such 
as, for example, the N-terminal cysteine or fimctional equivalent) and the remainder of 
the protein, A spacer is designed to provide separation between the-hydrophobic 
modification (e.g., the modified N-terminal cysteine) and the rest of the protein so as to 
prevent the modification firom interfering with protein fimction and/or make it easier for 
25 the modification (e.g., the N-terminal cysteine) to link with a lipid, vesicle, or other 
hydrophobic moiety. Thus, if a protein is modified at its N-terminal cysteine and at an 
amino acid at another site, there may be two, or more, spacer sequences. 

"tethered" protein- refers to a hydrophobically-raodified protein according to the 
invention. 



"multivalent protein complex"- refers to a plurality of proteins (i.e., one or more). 
A lipid or otlier hydrophobic moiety is attached to at least one of the plurality of 
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proleins. The lipid or other hydrophobic moiety may optionally be in contact with a 
vesicle. If a protein lacks a lipid or other hydrophobic moiety, then that protein may be 
cross-linked or bind to a protein that does have a lipid or other hydrophobic moiety. 
Each protein may be the same or different and each lipid or other hydrophobic moiety 
5 may be the same or different 

"vesicle"- refers to any aggregate of lipophilic molecules. The vesicle may be 
obtained from a biologic source (e.g., a lipid bilayer such as a cell membrane or a 
cholic acid-derived detergent preparation) or from a non-biologic source (e.g., a non- 
biologic detergent vesicle as described in Section VI). The shape, type, and 
1 0 configuration of the vesicle is not intended to limit the scope of this invention. 

"functional equivalent" of an amino acid residue (e.g., an N-terminal cysteine> is (i) 
an amino acid having similar reactive properties as the amino acid residue that was 
replaced by the functional equivalent; (ii) an amino acid of a ligand of a polypeptide of 
the invention, the amino acid having similar hydrophobic (e.g., lipid) moiety binding 
1 5 properties as the amino acid residue that was replaced by the fimctional equivalent; (iii) 
a non-amino acid molecule having similar hydrophobic (e.g., lipid) moiety binding 
properties as the amino acid residue that was replaced by the functional equivalent. 

"genetic fusion"- refers to a co-linear, covalent linkage of two or more proteins or 
fragments thereof via their individual peptide backbones, through genetic expression of 
20 a polynucleotide molecule encoding those proteins. 

A "chimeric protein" or "fusion protein" is a fiision of a first ammo acid 
sequence encoding a hedgehog polypeptide with a second amino acid sequence 
defining a domain foreign to and not substantially homologous with any domain of hh 
protein. A chimeric protein may present a foreign domain which is found (albeit in a 
25 different protein) in an organism which also expresses the first protein, or it may be an 
"interspecies", "intergenic", etc. fusion of protein structures expressed by different 
kinds of organisms. In general, a fusion protein can be represented by the general 
formula Q()n'{hh)j^'(Y)^, wherein hh represents all or a portion of the hedgehog 

protein, X and Y each independently represent an amino acid sequences which are not 
30 naturally found as a polypeptide chain contiguous with the hedgehog sequence, m is an 
integer greater than or equal to 1, and each occurrence of n is, independently, 0 or an 
integer greater than or equal to 1 (n and m are preferably no greater than 5 or 1 0). 



s 
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"mutant" - any change in the genetic material of an organism, in particular any 
change (i.e., deletion, substitution, addition, or alteration) in a wild type polynucleotide 
sequence or any change in a wild type protein. 

"wild type" - the naturally-occurring polynucleotide sequence of an exon of a 
5 protein, or a portion thereof, or protein sequence, or portion thereof, respectively, as it 
normally exists in vivo. 

"standard hybridization conditions"- salt and temperature conditions substantially 
equivalent to 0.5 X SSC to about 5 X SSC and 65°C for both hybridization and wash. 
The term "standard hybridization conditions" as used herein is an operational definition 
10 and encompasses a range of hybridization conditions. See also Current Protocob in 
Molecular Biology, John Wiley & Sons, Inc. New York, Sections 6.3.1-6.3.6, (1989). 

"expression control sequence"- a sequence of polynucleotides that controls and 
regulates expression of genes when operatively linked to those genes. 

"operatively linked"- a polynucleotide sequence (DNA, RNA) is operatively linked 
15 to an expression control sequence when the expression control sequence controls and 
regulates the transcription and translation of that polynucleotide sequence. The term 
"operatively linked" includes having an appropriate start signal (e,g., ATG) in front of 
the polynucleotide sequence to be expressed, and maintaining the correct reading frame 
to permit expression of the polynucleotide sequence under the control of the expression 
20 control sequence, and production of the desired polypeptide encoded by the 
polynucleotide sequence. 

"expression vector"- a polynucleotide, such as a DNA plasmid or phage (among 
other common examples) which allows expression of at least one gene when the 
expression vector is introduced into a host cell. The vector may, or may not, be able to 
25 replicate in a cell. 

"Isolated" (used interchangeably with "substantially pure")- when applied to 
nucleic acid i.e., polynucleotide sequences that encode polypeptides, means an RNA or 
DNA polynucleotide, portion of genomic polynucleotide, cDNA or synthetic 
polynucleotide^ which, by virtue of its origin or manipulation: (i) is not associated with 
30 all of a polynucleotide with which it is associated in nature (e.g., is present in a host cell 
as an expression vector, or a portion thereof); or (ii) is linked to a nucleic acid or other 
chemical moiety other than that to which it is linked in nature; or (iii) does not occur in 
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nature. By "isolated" it is further meant a polynucleotide sequence that is: (i) amplified 
in vitro by, for example, polymerase chain reaction (PGR); (ii) synthesized chemically; 
(iii) produced recombinantly by cloning; or (iv) purified, as by cleavage and gel 
separation. 

5 Thus, "substantially pure nucleic acid" is a nucleic acid which is not immediately 
contiguous with one or both of the coding sequences with \^ch it is normally 
contiguous in the naturally occurring genome of the organism from which the nucleic 
acid is derived. Substantially pure DN A also includes a recombinant DNA which is part 
of a hybrid gene encoding additional hedgehog sequences. 

10 "Isolated" (used interchangeably with "substantially pure")- when applied to 
polypeptides means a polypeptide or a portion thereof which, by virtue of its origin or 
manipulation: (i) is present in a host cell as the expression product of a portion of an 
expression vector; or (ii) is linked to a protein or other chemical moiety other than that 
to which it is linked in nature; or (iii) does not occur in nature, for example, a protein 

15 that is chemically manipulated by appending, or adding at least one hydrophobic 
moiety to the protein so that the protein is in a form not found in nature.. By "isolated" 
it is further meant a protein that is : (i) synthesized chemically; or (ii) expressed in a 
host cell and purified away firom associated and contaminating proteins. The term 
generally means a polypeptide that has been separated fipom other protems and nucleic 

20 acids wth which it naturally occurs. Preferably, the polypeptide is also separated from 
substances such as antibodies or gel matrices (polyacrylamide) which are used to purify 
h. 

i 

"Heterologous promoter"- as used herein is a promoter which is not naturally 
associated with a gene or a purified nucleic acid. 

25 "Homologous"- as used herein is synonymous with the term "identity" and refers to 
the sequence similarity between two polypeptides, molecules, or between two nucleic 
acids. When a position in both of the two compared sequences is occupied by the same 
base or amino acid monomer subunit (for instance, if a position in each of the two DNA 
molecules is occupied by adenine, or a position in each of two polypeptides is occupied 

30 by a lysine), then the respective molecules are homologous at that position. The 
percentage homology between two sequences is a function of the number of matching 
or homologous positions shared by the two sequences divided by the number of 
positions compared x 100. For instance, if 6 of 10 of the positions in two sequences are 



wo 99/28343 PCr/US98/25676 



-16- 

matched or are homologous, then the two sequences are 60% homologous. By way of 
example, the DNA sequences CTGACT and CAGGTT share 50% homology (3 of the 6 
total positions are matched). Generally, a comparison is made when two sequences are 
aligned to give maximum homology. Such alignment can be provided using, for 
5 instance, the method of Needleman et al., J, Mol Biol 48: 443-453 (1970), implemented 
conveniently by computer programs such as the Align program (DNAstar, Inc.). 
Homologous sequences share identical or similar amino acid residues, where similar 
residues are conservative substitutions for, or "allowed point mutations" of, 
corresponding amino acid residues in an aligned reference sequence. In this regard, a 

10 "conservative substitution" of a residue in a reference sequence are those substitutions 
that are physically or functionally similar to the corresponding reference residues, e.g., 
that have a similar size, shape, electric charge, chemical properties, including the ability 
to form covalent or hydrogen bonds, or the like. Particularly preferred conservative 
substitutions are those fulfilling the criteria defined for an "accepted point mutation" in 

15 Dayhoff et al., 5: Atlas of Protein Sequence and Structure, 5: Suppl. 3, chapter 22: 
354-352, Nat Biomed. Res. Foundation, Washington, D.C. (1978). 

A "hedgehog protein" or "hedgehog polypeptide", as the terms are used 
interchangeably, of the invention is defined in terms of having at least a portion that 
consists of the consensus amino acid sequence of SEQ ID NO: 4. The term also means 

20 a hedgehog polypeptide, or a functional variant of a hedgehog polypeptide, or homolog 
of a hedgehog polypeptide, or functional variant, which has biological activity. In 
particular, the terms encompasses preparations of : proteins and peptidyl 

fragments thereof, both agonist and antagonist forms as the specific context will make 
clear.As used herein the term "bioactive fragment of a hedgehog protein" refers to a 

25 fiagment of a full-length hedgehog polypeptide, wherein the firagment specifically 
agonizes or antagonizes inductive events mediated by wild-type hedgehog proteins. 
The hedgehog biactive fragment preferably is a soluble extracellular portion of a 
hedgehog protem, where solubility is with reference to physiologically compatible 
solutions. Exemplary bioactive firagments are described m PCT publications WO 

30 95/18856 and WO 96/17924. In preferred embodiments, the hedgehog polypeptides of 
the present invention bind to the patched ^xoXtin, 
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The term "corresponds to", when referring to a particular polypeptide or nucleic 
acid sequence is meant to indicate that the sequence of interest is identical or 
homologous to the reference sequence to which it is said to correspond. 

The terms "peptide(sy', "protein(sy' and "polypeptide(s)" are used 
5 interchangeably herein. The terms "polynucleotide sequence" and "nucleotide 
sequence" are also used interchangeably herein. The terms "Hedgehog fragment" and 
"Hedgehog N-terminal fragment" are used interchangeably with "Hedgehog". 

A hedgehog molecule has "biological activity" if it has at least one of the following 
properties: (i) the molecule meets the hedgehog consensus criteria as defined herein 

10 (SEQ ID NO: 4) and has the ability to bmd to its receptor, patched or it encodes, upon 
expression, a polypeptide that has this characteristic; (ii) the molecule meets the 
hedgehog consensus criteria as defined herein or it encodes, upon expression, a 
polypeptide that has this characteristic; and (iii) it may induce alkaline phosphatase 
activity in C3H10T1/2 cells. Generally, any protein has "biological activity" if the 

15 protein has in vitro effects, properties, or characteristics that persons having ordinary 
skill in the art would recognize as being representative of, commensurate with, or 
reasonably predictive of, the protein's m vivo effects. 

The term "hydrophobic" refers to the tendency of chemical moieties with nonpolar 
atoms to interact with each other rather than water or other polar atoms. Materials that 

20 are "hydrophobic" are, for the most part, insoluble in water. Natural products with 
hydrophobic properties include lipids, fatty acids, phospholipids, sphingolipids, 
acylglycerols, waxes, sterols, steroids, terpenes, - prostaglandms, thromboxanes, 
leukotrienes, isoprenoids, retenoids, biotin, and hydrophobic amino acids such as 
tryptophan, phenylalanine, isoleucme, leucine, valine, methionine, alanine, proline, and 

25 tyrosine. A chemical moiety is also hydrophobic or has hydrophobic properties if its 
physical properties are determined by the presence of nonpolar atoms. The term 
includes Upophilic groups. 

The term "lipophilic group", in the context of bemg attached to a polypeptide, 
refers to a group having high hydrocarbon content thereby giving the group high 
30 affmity to lipid phases. A lipophilic group can be, for example, a relatively long chain 
alkyl or cycloalkyl (preferably n-alkyl) group having approximately 7 to 30 carbons. 
The alkyl group may terminate with a hydroxy or primary amine "tail". To further 
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illustrate, lipophilic molecules include naturally-occurring and synthetic aromatic and 
non-aromatic moieties such as fatty acids, esters and alcohols^ other lipid molecules, 
cage structures such as adamantane and buckminsterfiillerenes, and aromatic 
hydrocarbons such as benzene, perylene, phenanthrene, anthracene, naphthalene, 
5 pyrene, chrysene, and naphthacene. 

The phrase "internal amino acid" means any amino acid in a peptide sequence that 
is neither the N-terminal ammo acid nor the C-terminal amino acid. 

The phrase "surface amino acid" means any amino acid that is exposed to solvent 
when a protem is folded in its native form. 

1 0 The phrase "extracellular signaling protein" means any protein that is either secreted 
from a cell, or is tethered to die outside of a cell, and upon binding to the receptor for 
that protein on a target cell triggers a response in the target cell. 

An "effective amount" of, e.g., a luduciu',: rnW^p.-jiudj, with respect to the 
subject methods of treatment, refers to an amount of polypeptide in a preparation 
1 5 which, when applied as part of a desired dosage regimen brings about, e.g., a change in 
the rate of cell proliferation and/or the state of differentiation of a cell and/or rate of 
survival of a cell according to clinically acceptable standards for the disorder to be 
treated or the cosmetic purpose. 

A "patient" or "subject" to be treated by the subject method can mean either a 
20 human or non-human animal. 

The "growth state" of a cell refers to the rate of proliferation of the cell and the 
state of differentiation of the cell. 

Practice of the present invention will employ, unless indicated otherwise, 
conventional techniques of cell biology, cell culture, molecular biology, microbiology, 
25 recombinant DNA, protein chemistry, and immunology, which are within the skill of 
the art. Such techniques are described in the literatxire. 

n. General Properties of Isolated Hedgehog Proteins 
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The polypeptide portion of the //t\/-,\ > compositions of the subject method 
can be generated by any of a variety of techniques, including purification of naturally 
occiuring proteins, recombinantly produced proteins and synthetic chemistry. 
Polypeptide forms of the hedgehog therapeutics are preferably derived from vertebrate 
5 hedgehog proteins, e.g., have sequences corresponding to naturally occunring hedgehog 
proteins, or fragments thereof, from vertebrate organisms. However, it will be 
appreciated that the hedgehog polypeptide can conespond to a hedgehog protein (or 
fragment thereof) which occurs in any metazoan organism. 

Isolated hedgehog proteins used in the methods of this invention are naturally 
1 0 occurring or recombinant proteins of the hedgehog family and may be obtainable from 
either invertebrate or from vertebrate sources (see references below). Members of the 
vertebrate hedgehog protein family share homology with proteins encoded by the 
Drosophila hedgehog (hh) gene (33). To date, the combined screening of mouse 
genomic and cDNA libraries has identified three mammalian hh counterparts referred to 
15 as Sonic hedgehog (5/jh), Indian hedgehog {Ihh\ and Desert hedgehog (Dhh\ which 
also exist in other mammals, including humans, as well as in fish and in birds. Other 
members include Moonrat hedgehog {Mhh\ as well as chicken Sonic hh and zebrafish 
Sonic hh. 

Mouse and chicken Shh and mouse Ihh genes encode glycoproteins which 
20 undergo cleavage, yielding an amino terminal fragment of about 20kDa (See Figure 8) 
and a carboxy terminal fragment of about 25kDa. The most preferred 20kDa Augment 
has the consensus sequence SEQ ID NO: 4 and includes the amino acid sequences of 
SEQ ID NOS: 1-3. Various other firagments that encompass the 20kDa moiety are 
considered within the presentiy claimed invention. Publications disclosing these 
25 sequences, as well as their chemical and physical properties, include (34-38); PCT 
Patent Applications WO 95/23223 (Jessell, Dodd, Roelink and Edlund), WO 95/18856 
(Ingham, McMahon and Tabin) and WO 96/17924 (Beachy et al.). 

Family members useful in the mediods of the invention include any of the 
naturally-occurring native hedgehog protems including allelic, phylogenetic 
30 counterparts or other variants thereof, whether naturally-sourced or produced 
chemically including muteins or mutant proteins, as well as recombinant forms and 
new, active members of the hedgehog family. Particularly useful hedgehog 
polypeptides include SEQ ID NOS: 1-4. 
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Isolated hedgehog polypeptides used in the method of the invention have 
biological activity. The polypeptides include an amino acid sequence at least 60%, 
80%, 90%, 95%, 98%, or 99% homologous to an amino acid sequence from SEQ ID 
NOS; 1-4, The polypeptide can also include an amino acid sequence essentially the 
5 same as an amino acid sequence in SEQ ID NOS: 1-4. The polypeptide is at least 5, 1 0, 
20, 50, 100. or 150 amino acids in length and includes at least 5, preferably at least 10, 
more preferably at least 20, most preferably at least 50, 100, or 150 contiguous amino 
acids from SEQ ID NOS: 1-4. 

The preferred polypeptides of the invention include a hedgehog polypeptide 
10 sequence as well as other N-terminal and/or C-temiinal amino acid sequence or it may 
include all or a fragment of a hedgehog amino acid sequence. The isolated hedgehog 
polypeptide can also be a recombinant fusion protein havmg a first hedgehog portion 
and a second polypeptide portion, e.g., a second polypeptide portion having an amino 
acid sequence unrelated to hedgehog. The second polypeptide portion can be, e.g., 
15 histidine tag, maltose binding protein, glutathione-S-transferase, a DNA binding 
domain, or a polymerase activating domain. 

Polypeptides of the invention include those which arise as a result of the 
existence of multiple genes, alternative transcription events, alternative RNA splicing 
events, and alternative translational and posttranslational events. The polypeptide can 
20 be made entirely by synthetic means or can be expressed in systems, e.g., cuhured cells, 
which result in substantially the same posttranslational modifications present when the 
protein is expressed in a native cell, or in systems which result in the omission of 
posttranslational modifications present when expressed in a native cell. 

In a preferred embodiment, isolated hedgehog is a hedgehog polypeptide vrfth 
25 one or more of the following characteristics: 

(i) it has at least 30, 40, 42, 50, 60, 70, 80, 90 or 95% sequence identity 
with amino acids of SEQ ID NOS: 1-4; 

(ii) it has a cysteine or a functional equivalent as the N-terminal end; 

(iii) it may induce alkaline phosphatase activity in C3H10T1/2 cells; 

30 (iv) it has an overall sequence identity of at least 50%, preferably at least 

60%, more preferably at least 70, 80, 90, or 95%, with a polypeptide of SEQ ID NO; 1- 
4 
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(v) it can be isolated from natural sources such as mammalian cells; 

(vi) it can bind or interact with patched; and 

(vii) it is hydrophobically-modified (i.e., it has at least one hydrophobic 
moiety attached to the polypeptide). 

5 

III. Other Proteins 

Since techniques exist for engineering a cysteine residue (or its functional 
equivalent) into a polypeptide's primary sequence, virtually any protein can be 
converted into a hydrophobically-modified form using the methods described herein. 

10 Viral receptors, cell receptors, and cell ligands are useful because they bind 

typically to cells or tissues exhibiting many copies of the receptor. Useful viral-cell 
protein receptors that can be complexed together using the methods of this invention 
include ICAMl, a rhinovirus receptor; CD2, the Epstein-Barr virus receptor; and CD4, 
the receptor for human immunodeficiency virus (HIV). Other proteins include 

15 members of the cell adhesion molecule family, such as ELAM-1 and VCAM-1 and 
VCAM-lb and their lymphocyte counterparts (ligands); the lymphocyte associated 
antigens LFAl, LFA2 (CD2) and LFA3 (CD58), CD59 (a second ligand of CD2), 
members of the CD11/CD18 family and very late antigens such as VLA4 and their 
ligands. 

20 Immunogens from a variety of pathogens (e.g., from bacterial, fungal, viral, and 

other eukaryotic parasites) may also be used as polypeptides in the methods of the 
invention. Bacterial immunogens include, but are not Hmited to, bacterial sources 
responsible for bacterial pneumonia and pneimiocystis pneumonia. Parasitic sources 
include the Plasmodium malaria parasite. Viral sources include poxvirus (e.g, cowpox, 

25 herpes simplex, cytomegalovirus); adenoviruses; papovaviruses (e.g., papillomavirus); 
parvoviruses (e.g., adeno-associated virus); retroviruses (e.g., HTLV I, HTLV 11, HIV I 
and HIV II) and others. Immunoglobulins, or fragments thereof, may also be 
polypeptides that can be modified according to the invention. One can generate 
monoclonal Fab fragments recognizing specific antigens using conventional methods 

30 (49) and use the individual Fab domains as functional moieties in multimeric constructs 
according to this invention. Other useful proteins include, gelsolin (50); cytokines, 
including the various interferons (interferon-a, interferon-p, and interferon-y); the 
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various interleukins (e,g., IL-1 , -2, -3, -4, and the like); the tumor necrosis factors-a 
and -P; monocyte colony stimulating factor (M-CSF), granulocyte colony stimulating 
factor (G-CSF), granulocyte macrophage colony stimulating factor (GM-CSF), 
erythropoietin, platelet-derived growth factor (PDGF), and human and animal 
5 hormones, including growth hormone and insulin. 

Generally, the structure of the modified proteins of this invention has the 
general formula: A-CyS"[Sp]-B-[Sp]-X, where A is a hydrophobic moiety; Cys is a 
cysteine or a functional equivalent thereof; [Sp] is an optional spacer peptide sequence; 
B is a protein (which optionally may have another spacer peptide sequence as shown); 

1 0 and X is a hydrophobic moiety linked (optionally by way of the spacer peptide) to the a 
C-terminal end of the protein or another surface site of the protein, wherein the 
derivatized protein includes at least one of A or X. If X is cholesterol, then B may, or 
may not be, a hedgehog protein. As discussed above, the purpose of the spacer is to 
provide separation between the hydrophobic moiety and the rest of the protein so as to 

15 make it easier for the hydrophobic moiety (e.g., a modified N-terminal cysteine) to link 
with another moiety which may be a lipid or a vesicle. The spacer is also intended to 
make it more difficult for the modification to interfere with protein function. A spacer 
may be as small as a single amino acid in length. Generally, prolines and glycines are 
preferred, A particularly preferred spacer sequence is derived from Sonic hedgehog 

20 and consists of the amino acid sequence: G-P-G-R. 

IV* Production of Recombinant Polypeptides 

The isolated polypeptides described herein can be produced by any suitable method 
known in the art. Such methods range from direct protein synthetic methods to 
25 constructing a DNA sequence encoding isolated polypeptide sequences and expressing 
those sequences in a suitable transformed host 

In one embodiment of a recombinant method, a DNA sequence is constructed 
by isolating or synthesizing a DNA sequence encoding a wild type protein of interest. 
Optionally, the sequence may be mutagenized by site-specific mutagenesis to provide 
30 functional analogs thereof See, e.g., (40) and United States Patent 4,588,585. Another 
method of constructing a DNA sequence encoding a polypeptide of interest would be 
by chemical synthesis using an oligonucleotide synthesizer. Such oligonucleotides may 
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be preferably designed based on the amino acid sequence of the desired polypeptide, 
and preferably selecting those codons that are favored in the host cell in which the 
recombinant polypeptide of interest will be produced. 

Standard methods may be applied to synthesize an isolated polynucleotide 
5 sequence encoding a isolated polypeptide of interest For example, a complete amino 
acid sequence may be used to construct a back-translated gene. See Maniatis et al., 
supra. Further, a DNA oligomer containing a nucleotide sequence coding for the 
particular isolated polypeptide may be synthesized. For example, several small 
oligonucleotides coding for portions of the desired polypeptide may be synthesized and 
10 then Ugated. The individual oligonucleotides typically contain 5* or 3* overhangs for 
complementary assembly. 

Once assembled (by synthesis, site-directed mutagenesis, or by another 
method), the mutant DNA sequences encoding a particular isolated polypeptide of 
interest will be inserted into an expression vector and operatively linked to an 

15 expression control sequence appropriate for expression of the protein in a desired host. 
Proper assembly may be confirmed by nucleotide sequencing, restriction mapping, and 
expression of a biologically active polypeptide in a suitable host As is well known in 
the art, in order to obtain high expression levels of a transfected gene in a host, the gene 
must be operatively linked to transcriptional and translational expression control 

20 sequences that are functional in the chosen expression host. 

The choice of e?q)ression control sequence and expression vector will depend 
upon the choice of host. A wide variety of expression host/vector combinations may be 
employed. Useful expression vectors for eukaiyotic hosts, include, for example, 
vectors comprising expression control sequences from SV40, bovine papilloma virus, 

25 adenovirus and cytomegalovirus. Useful expression vectors for bacterial hosts include 
known bacterial plasmids, such as plasmids from Esherichia coli, including pCRl, 
pBR322, pMB9 and their derivatives, wider host range plasmids, such as Ml 3 and 
filamentous single-stranded DNA phages. Preferred £ coli vectors include pL vectors 
containing tiae lambda phage pL promoter (U.S. Patent 4,874,702), pET vectors 

30 containing the T7 polymerase promoter (Studier et al.. Methods in Enzymology 1 85: 
60-89, 1990 1) and the pSP72 vector (Kaelin et al., supra). Useful expression vectors for 
yeast cells, for example, include the 2 T and centromere plasmids. 
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In addition, any of a wide variety of expression control sequences may be used 
in these vectors. Such xiseful expression control sequences include the expression 
control sequences associated with structural genes of the foregoing expression vectors. 
Examples of useful expression control sequences include, for example, the early and 
5 late promoters of SV40 or adenovirus, the lac system, the trp system, the TAC or TRC 
system, the major operator and promoter regions of phage lambda, for example pL, the 
control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the 
yeast a-mating system and other sequences known to control the expression of genes of 
1 0 prokaryotic or eukaryotic cells and their viruses, and various combinations thereof 

Any suitable host may be used to produce in quantity the isolated hedgehog 
polypeptides described herein, including bacteria, fiingi (including yeasts), plants, 
insects, manunals, or other appropriate animal cells or cell lines, as well as transgenic 
animals or plants. More particularly, these hosts may include well known eukaryotic 
15 and prokaryotic hosts, such as strains of E. coli, Pseudomonas, Bacillus, Streptomyces, 
fungi, yeast (e.g., Hansenula ), insect cells such as Spodoptera frugiperda (SF9), and 
High Five™ (see Example 1), animal cells such as Chinese hamster ovary (CHO), 
mouse cells such as NS/0 cells, African green monkey cells COSl, COS 7, BSC 1, 
BSC 40, and BMT 10, and human cells, as well as plant cells. 

20 It should be understood that not all vectors and expression control sequences 

will function equally well to express a given isolated polypeptide. Neither will all hosts 
function equally well with the same expression system. However, one of sldll in the art 
may make a selection among these vectors, expression control systems and hosts 
without undue experimentation. For example, to produce isolated polypeptide of 

25 interest in large-scale animal culture, the copy number of the expression vector must be 
controlled. Amplifiable vectors are well known in the art. See, for example, (41) and 
U.S. Patents 4,470,461 and 5,122,464. 

Such operative linking of a DNA sequence to an expression control sequence 
includes the provision of a translation start signal in the correct reading frame upstream 
30 of the DNA sequence. If the particular DNA sequence being expressed does not begin 
with a methionine, the start signal will result in an additional amino acid (methionine) 
being located at the N-terminus of the product. If a hydrophobic moiety is to be linked 
to the N-terminal methionyl-containing protein, the protein may be employed directly 
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in the compositions of the invention. Neverthless, since the preferred N-terminal end of 
the protein is to consist of a cysteine (or functional equivalent) the methionine must be 
removed before use. Methods are available in the art to remove such N-termmal 
methionines fix)m polypeptides expressed with them. For example, certain hosts and 
5 fennentation conditions permit removal of substantially all of the N-termmal 
methionine in vivo. Other hosts require in vitro removal of the N-terminal methionine. 
Such in vitro and in vivo methods are well known in the art. 

The proteins produced by a transformed host can be purified according to any 
suitable method. Such standard methods include chromatography (e.g., ion exchange, 

10 affinity, and sizing column chromatography), centrifugation, differential solubility, or 
by any other standard technique for protein purification. For immunoaffinity 
chromatography (See Example 1), a protein such as Sonic hedgehog may be isolated by 
binding it to an affinity column comprising of antibodies that were raised against Sonic 
hedgehog, or a related protein and were affixed to a stationary support. Alternatively, 

15 affinity tags such as hexahistidine, maltose binding domain, influenza coat sequence, 
and glutathione-S-transferase can be attached to the protein to allow easy purification 
by passage over an appropriate affinity column. Isolated proteins can also be 
characterized physically using such techniques as proteolysis, nuclear magnetic 
resonance, and X-ray crystallography. 

20 

A. Production of Fragments and Analogs 

Fragments of an isolated protein (e.g., fragments of SEQ ID NOS: 1-4) can also be 
produced efficiently by recombinant methods, by proteolytic digestion, or by chemical 
synthesis using methods known to those of skill in the art. In recombinant methods, 

25 internal or terminal fragments of a polypeptide can be generated by removing one or 
more nucleotides from one end (for a terminal fragment) or both ends (for an internal 
fragment) of a DNA sequence which encodes for the isolated hedgehog polypeptide. 
Expression of the mutagenized DNA produces polypeptide fragments. Digestion with 
"end nibbling" endonucleases can also generate DNAs which encode an anray of 

30 fragments, DNAs which encode fragments of a protein can also be generated by 
random shearing, restriction digestion, or a combination or both. Protein fragments can 
be generated directly from intact proteins. Peptides can be cleaved specifically by 
proteolytic enzymes, including, but not limited to plasmin, thrombin, trypsin. 
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chymotrypsin, or pepsin. Each of these enzymes is specific for the type of peptide bond 
it attacks. Trypsin catalyzes the hydrolysis of peptide bonds in which the carbonyl 
group is from a basic amino acid, usually arginine or lysine. Pepsin and chymotrypsin 
catalyse the hydrolysis of peptide bonds from aromatic amino acids, such as 
5 tryptophan, tyrosine, and phenylalanine. Alternative sets of cleaved protein fragments 
are generated by preventing cleavage at a site which is suceptible to a proteolytic 
enzyme. For instance, reaction of the e-amino acid group of lysine with 
ethyltrifluorothioacetate in mildly basic solution yields blocked amino acid residues 
whose adjacent peptide bond is no longer susceptible to hydrolysis by trypsin. Proteins 

10 can be modified to create peptide linkages that are susceptible to proteolytic enzymes. 
For instance, alkylation of cysteine residues with P-haloethylamines yields peptide 
linkages that are hydrolyzed by trypsin (51). In addition, chemical reagents that cleave 
peptide chains at specific residues can be used. For example, cyanogen bromide cleaves 
peptides at methionine residues (52). Thus, by treating proteins with various 

1 5 combinations of modifiers, proteolytic enzymes and/or chemical reagents, the proteins 
may be divided into fragments of a desired length with no overlap of the fragments, or 
divided into overlapping firagments of a desired length. 

Fragments can also be synthesized chemically using techniques known in the art 
such as the Merrifield solid phase F moc or t-Boc chemistry. Merrifield, Recent 
20 Progress in Hormone Research 23: 451 (1967) 

Examples of prior art methods which allow production and testing of fragments 
and analogs are discussed below. These, or analogous methods may be used to make 
and screen fragments and analogs of an isolated polypeptide (e.g., hedgehog) which can 
be shown to have biological activity. An exemplary method to test whether fragments 
25 and analogs of hedgehog have biological activity is found in Example 3. 

B. Production of Altered DNA and Peptide Sequences: Random Methods 

Amino acid sequence variants of a protem (such as variants of SEQ ID NOS: 1-4) 
can be prepared by random mutagenesis of DNA which encodes the protem or a 
30 particular portion thereof. Useful methods include PGR mutagenesis and saturation 
mutagenesis. A library of random amino acid sequence variants can also be generated 
by the synthesis of a set of degenerate oligonucleotide sequences. Methods of 
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generating amino acid sequence variants of a given protein using altered DNA and 
peptides are well-known in the art. The following examples of such methods are not 
intended to limit the scope of the present invention, but merely serve to illustrate 
representative techniques. Persons having ordinary skill in the art will recognize that 
5 other methods are also useful in this regard. 

PGR Mutagenesis : Briefly, Taq polymerase (or another polymerase) is used to 
introduce random mutations into a cloned fragment of DNA (42). PGR conditions are 
chosen so that the fidelity of DNA synthesis is reduced by Taq DNA polymers using, 
1 0 for instance, a dGTP/dATP ratio of five and adding Mn^* to the PGR reaction. The pool 
of amplified iDNA fi:agments is inserted into appropriate cloning vectors to provide 
random mutant libraries. 

Saturation Mutagenesis: One mediod is described generally in (43). Briefly, the 
15 technique includes generation of mutations by chemical treatment or irradiation of 
single stranded DNA in vitro, and synthesis of a cDNA strand. The mutation frequency 
is modulated by the severity of the treatment and essentially all possible base 
substitutions can be obtained. 

20 Degenerate Oligonucleoride Mutagenesis : A library of homologous peptides can be 
generated from a set of degenerate oligonucleotide sequences. Ghemical synthesis of 
degenerate sequences can by performed in an automatic DNA synthesizer, and the 
synthetic genes are then ligated into an appropriate expression vector. See for example 
(44, 45) and Itakura et al.. Recombinant DNA, Proc. 3rd Gleveland Symposium on 

25 Macromolecules, pp. 273-289 (A.G. Walton, ed,), Elsevier, Amsterdam, 1981 . 

C, Production of Altered DNA and Peptide Sequences: Directed Methods 

Non-random, or directed, mutagenesis provides specific sequences or mutations in 
specific portions of a polynucleotide sequence that encodes an isolated polypeptide, to 
30 provide variants which include deletions, insertions, or substitutions of residues of the 
known amino acid sequence of the isolated polypeptide. The mutation sites may be 
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modified individually or in series, for instance by: (1) substituting first with conserved 
amino acids and then with more radical choices depending on the results achieved; (2) 
deleting the target residue; or (3) inserting residues of the same or a different class 
adjacent to the located site, or combinations of options 1-3, 

5 Clearly, such site-directed methods are one way in which an N-terminal cysteine 

(or a functional equivalent) can be introduced into a given polypeptide sequence to 
provide the attachment site for a hydrophobic moiety. 

Alanine scanning Mutagenesis: This method locates those residues or regions of a 
1 0 desired protein that are preferred locations for mutagenesis (46). In alanine screening, a 
residue or group of target residues are selected and replaced by alanine. This 
replacement can affect the interaction of the amino acid with neighboring amino acids 
and/or with the surrounding aqueous or membrane environment. Those having 
functional sensitivity to the substitutions are then refined by introducing fiuther or other 
1 5 variants at, or for, the sites of substitution. 

Oil gonucleotide-Mediated Mutagenesis: One version of this method may be used to 
prepare substitution, deletion, and insertion variants of DNA (47). Briefly, the deshred 
DNA is altered by hybridizing an oligonucleotide primer encoding a DNA mutation to 

20 a DNA template which typically is the single stranded fomi of a plasmid or phage 
containing the unaltered or wild type DNA sequence template of the desired protein 
(e.g., the Hedgehog protein). After hybridization, a DNA polymerase is used to make 
the second and complementary strand of DNA of the template that will incorporate the 
oligonucleotide primer, and will code for the selected alteration in the desired DNA 

25 sequence. Generally, oligonucleotides of at least 25 nucleotides in length are used. An 
optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the mutation. This ensures that the 
oHgonucleotide will hybridize properly to the single-stranded DNA template molecule. 

Cassette Mutagenesis: This method (48) requires a plasmid or other vector that 
30 contains the protein subunit DNA to be mutated. The codon(s) in the protein subunit 
DNA are identified and there is inserted a unique restriction endonuclease site on each 
side of the identified mutation site(s), using the above-described oligonucleotide- 
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directed mutagenesis method. The plasmid is then cut at these sites to linearize it. A 
double-stranded oligonucleotide encoding the sequence of the DNA between the 
restriction sites but containing the desired mutation(s) is synthesized using standard 
procedures. The two strands are synthesized separately and then hybridized together 
5 using standard methods. This double-stranded oligonucleotide is the "cassette" and it 
has 3* and 5' ends that are compatible with the ends of the linearized plasmid so that it 
can be directly ligated therein. The plasmid now contains the mutated desired protein 
subunit DNA sequence. 



10 Combinatorial Mutagenesis: In one version of this method (Ladner et al., WO 
88/06630), thet amino acid sequences for a group of homologs or other related proteins 
are aligned, preferably to promote the highest homology possible. All of the amino 
acids which appear at a given position of the aligned sequences can be selected to create 
a degenerate set of combinatorial sequences. The variegated library is generated by 

1 5 combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated 
gene library. For instance, a mixture of synthetic oligonucleotides can be ligated 
enzymically into the gene sequence such that the degenerate set of potential sequences 
are expressible as individual peptides, or alternatively, as a set of protems containing 
the entire set of degenerate sequences. 

20 

D. Other Variants of Isolated Polypeptides 

Included in the invention are isolated molecules that are: allelic variants, natural 
mutants, induced mutants, and proteins encoded by DNA that hybridizes under high or 
low stringency conditions to a nucleic acid which encodes a polypeptide such as the N- 
25 terminal fragment of Sonic hedgehog (SEQ ID NO: 1) and polypeptides bound 
specifically by antisera to hedgehog peptides, especially by antisera to an active site or 
binding site of hedgehog. All variants described herein are expected to: (i) retain the 
biological function of the original protein and (ii) retain the ability to link to a 
hydrophobic moiety (e.g, a lipid). 

30 The methods of the invention also feature uses of fragments, preferably 

biologically active fragments, or analogs of an isolated peptide such as hedgehog. 
Specifically, a biologically active fragment or analog is one having any in vivo or in 
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vitro activity which is characteristic of the peptide shown in SEQ ID NOS: 1-4 or of 
other naturally occurring isolated hedgehog. Most preferably, the hydrophobicaliy- 
modified fragment or analog has at least 10%, preferably 40% or greater, or most 
preferably at least 90% of the activity of Sonic hedgehog (See Example 3) in any in 
5 vivo or in vitro assay. 

Analogs can differ from naturally occurring isolated protein in amino acid 
sequence or in ways that do not involve sequence, or both. The most preferred 
polypeptides of the invention have prefened non-sequence modifications that include in 
vivo or in vitro chemical derivatization (e.g., of their N-temiinal end), as well as 
10 possible changes in acetylation, methylation, phosphorylation, amidation, 
carboxylation, or glycosylation. 

Other analogs include a protem such as Sonic hedgehog or its biologically 
active fragments whose sequences differ from the wild type consensus sequence (e.g., 
SEQ ID NO: 4) by one or more conservative amino acid substitutions or by one or 

15 more non conservative amino acid substitutions, or by deletions or insertions which do 
not abolish the isolated protein's biological activity. Conservative substitutions 
typically include the substitution of one amino acid for another with similar 
characteristics such as substitutions within the following groups: valine, alanine and 
glycine; leucine and isoleucine; aspartic acid and glutamic acid; asparagine and 

20 glutamine; serine and threonine; lysine and arginine; and phenylalanine and tyrosine. 
The non-polar hydrophobic amino acids include alanine, leucine, isoleucine, valine, 
prolme, phenylalanine, tryptophan, and methionine. The polar neutral amino acids 
mclude glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The 
positively charged (basic) amino acids include arginine, lysine, and histidine. The 

25 negatively charged (acidic) ammo acids include aspartic acid and glutamic acid. Other 
conservative substitutions can be readily known by workers of ordinary skill. For 
example, for the amino acid alanine, a conservative substitution can be taken from any 
one of D-alanine, glycine, beta-alanine, L-cysteine, and D-cysteine. For lysine, a 
replacement can be any one of D-lysine, arginine, D-arginine, homo-arginine. 

30 methionine, D-methionine, ornithine, or D-omithine. 

Generally, substitutions that may be expected to induce changes in the 
functional properties of isolated polypeptides are those in which: (i) a polar residue, 
e.g., serine or threonine, is substituted for (or by) a hydrophobic residue, e.g., leucine. 
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isoleucine, phenylalanine, or alanine; (ii) a cysteine residue is substituted for (or by) 
any other residue (See Example 10); (iii) a residue having an electropositive side chain, 
e.g., lysine, arginine or histidine, is substituted for (or by) a residue having an 
electronegative side chain, e.g., glutamic acid or aspartic acid; or (iv) a residue having a 
5 bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having such a 
side chain, e.g., glycine. 

Other analogs used within the methods of the invention are those with 
modifications which increase peptide stability. Such analogs may contain, for example, 
one or more non-peptide bonds (which replace the peptide bonds) in the peptide 
10 sequence. Also included are: analogs that include residues other than naturally 
occurring L-amino acids, such as D-amino acids or non-naturally occurring or synthetic 
amino acids such as beta or gamma amino acids and cyclic analogs. Incorporation of 
D- instead of L-amino acids into the isolated hedgehog polypeptide may increase its 
resistance to proteases. See, U.S. Patent 5,219,990 supra, 

1 5 The term "fragment", as applied to an isolated hedgehog analog, can be as small 

as a single amino acid provided that it retains biological activity. It may be at least 
about 20 residues, more typically at least about 40 residues, preferably at least about 60 
residues in length. Fragments can be generated by methods known to those skilled in 
the art. The ability of a candidate fragment to exhibit isolated hedgehog biological 

20 activity can be also assessed by methods known to those skilled in the art as described 
herein. 

V. Making Hydrophobic Derivatives 

The inventors have discovered that increasing the overall hydrophobic nature of 
25 a signaling protein, such as a hedgehog protein, increases the biological activity of the 
protein. The potency of a signaling protein such as hedgehog can be Increased by; (a) 
chemically modifying, such as by adding a hydrophobic moiety to, the sulfhydryl 
and/or to the a-amine of the N-tenninal cysteine (Examples 8 and 9); (b) replacing the 
N-terminal cysteine with a hydrophobic amino acid (Example 10); or (c) replacing the 
30 N-temiinal cysteine with a different amino acid and then chemically modifying the 
substituted residue so as to add a hydrophobic moiety at the site of the substitution. 
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Additionally, modification of a protein such as hedgehog protein at an internal 
residue on the surface of the protein with a hydrophobic moiety by: (a) replacing the 
internal residue with a hydrophobic amino acid; or (b) replacing the internal residue 
with a different amino acid and then chemically modifying the substituted residue so as 
5 to add a hydrophobic moiety at the site of the substitution (See Example 1 0), will retain 
or enhance the biological activity of the protein. 

Additionally, modification of a protein such as a hedgehog protein at the C- 
terminus with a hydrophobic moiety by: (a) replacing the C-terminal residue with a 
hydrophobic amino acid; or (b) replacing the C-termuial residue with a different amino 
10 acid and then chemically modifying the substituted residue so as to add a hydrophobic 
moiety at the site of the substitution, will retain or enhance the biological activity of the 
protein. 

There are a wide range of lipophilic moieties with which hedgehog polypeptides 
can be derivatived. A lipophilic group can be, for example, a relatively long chain alkyl 

15 or cycloalkyl (preferably n-alkyl) group having approximately 7 to 30 carbons. The 
alkyl group may terminate with a hydroxy or primary amine "tail". To further 
illustrate, lipophilic molecules include naturally-occurring and synthetic aromatic and 
non-aromatic jtnoieties such as fatty acids, esters and alcohols, other lipid molecules, 
cage structures such as adamantane and buckminsterfullerenes, and aromatic 

20 hydrocarbons such as benzene, peiylene, phenanthrene, anthracene, naphthalene, 
pyrene, chrysene, and naphthacene. 

Particularly useful as lipophilic molecules are alicyclic hydrocarbons, saturated 
and unsaturated fatty acids and other lipid and phospholipid moieties, waxes, 
cholesterol, isoprenoids, terpenes and polyalicyclic hydrocarbons including adamantane 

25 and buckminsterfullerenes, vitamins, polyethylene glycol or oligoethylene glycol, (Cl- 
C18).alkyl phosphate diesters, -0-CH2.CH(OH)-0-(C12-C18)-alkyl, and in particular 
conjugates witii pyrene derivatives. The lipophilic moiety can be a lipophilic dye 
suitable for use in the invention include, but are not limited to, diphenylhexatriene, Nile 
Red, N-phenyl-l-naphtiiylamine, Prodan, Laurodan, Pyrene, Perylene, rhodamine, 

30 rhodamine B, tetramethylrhodamine, Texas Red, sulforhodamine, l,r-didodecyl- 
3,3,3',3*tetramethylindocarbocyanine perchlorate, octadecyl rhodamine B and the 
BODIPY dyes available from Molecular Probes Inc. 
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Other exemplary lipophilic moietites include aliphatic carbonyi radical groups 
include 1- or 2-adamantylacetyl, 3-methyladamanM-ylaceityl, 3-methyl-3-bromo-l- 
adamantylacetyl, 1-decaIinacetyl, camphoracetyl, camphaneacetyl, noradamantylacetyl, 
norbomaneacetyl, bicyclo[2.2.2.]-oct-5-eneacety 1, 1 -methoxybicyclo[2.2.2.]-oct-5-ene- 
5 2"Carbonyl, cis-5-norbomene-endo-2,3"dicarbonyl, 5-norbomen-2-ylacetyl, (lR)-( - )- 
myrtentaneacetyl, 2-norbomaneacetyI, anti-3-oxo-tricyclo[2.2.1.0<2,6> ]-heptane-7- 
carbonyi, decanoyl, dodecanoyl, dodecenoyl, tetradecadienoyl, decynoyl or 
dodecynoyl. 

Structures of exemplary hydrophobic modifications are shown in Figure 12. If 
10 an appropriate amino acid is not available at a specific position, site-directed 
mutagenesis can be used to place a reactive amino acid at that site. Reactive amino 
acids include cysteine, lysine, histidine, aspartic acid, glutamic acid, serine, threonine, 
tyrosine, arginine, methionine, and tryptophan. Mutagenesis could be used to place the 
reactive amino acid at the N- or C-terminus or at an internal position. 

1 5 For example, we have discovered that it is possible to chemically modify an N- 

terminal cysteine of a biologically active protein, such as a hedgehog protein, or 
eliminate the N-terminal cysteine altogether and still retain the protein's biological 
activity, provided that the modified or substituted chemical moiety is hydrophobic. The 
inventors have found that enhancement of hedgehog's biological activity roughly 

20 correlates with the hydrophobicity of the modification. In addition to enhancing the 
protein's activity, modifying or replacing the N-terminal cysteine eliminates unwanted 
cross reactions and/or modifications of the cysteine that can occur during production, 
purification, formulation, and storage of the protein. The thiol of an N-tennmal 
cysteine is very reactive due to its proximity to the a-amine which lowers the pKa of 

25 the cysteine and increases proton dissociation and formation of the reactive thiolate ion 
at neutral or acid pH. 

We have demonstrated that replacement of the N-terminal cysteine of hedgehog 
with a hydrophobic amino acid results ui a protein with increased potency in a cell- 
based signaling assay. By replacing the cysteine, this approach eliminates the problem 
30 of suppressing other unwanted modifications of the cysteine that can occur during the 
production, purification, formulation, and storage of the protein. The generality of this 
approach is supported by our finding that three different hydrophobic amino acids, 
phenylalanine, isoleucine, and methionme, each give a more active form of hedgehog. 
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Therefore, replacement of the cysteine with any other hydrophobic amino acid should 
result in an active protein. Furthermore, since we have found a correlation between the 
hydrophobicity of an amino acid or chemical modification and the potency of the 
corresponding modified protein in the C3H10TI/2 assay (e.g. Phe > Met, long chain 
5 length fatty acids > short chain length), it could be envisioned that adding more than 
one hydrophobic amino acid to the hedgehog sequence would increase the potency of 
the protein beyond that achieved with a single amino acid addition. Indeed, addition of 
two consecutive isoleucine residues to the N-terminus of human Sonic hedgehog results 
in an increase in potency in the C3H10T1/2 assay as compared to the mutant with only 

10 a single isoleucine added (See Example 10). Thus, adding hydrophobic amino acids at 
the N- or C-terminus of a hedgehog protein, in a surface loop, or some combination of 
positions would be expected to give a more active form of the protein. The substituted 
amino acid need not be one of the 20 common amino acids. Methods have been 
reported for substituting unnatural amino acids at specific sites in proteins (78, 79) and 

15 this would be advantageous if the amino acid was more hydrophobic in character, 
resistant to proteolytic attack, or could be used to further direct the hedgehog protein to 
a particular site in vivo that would make its activity more potent or specific. Unnatural 
amino acids cdn be incorporated at specific sites in proteins during in vitro translation, 
and progress is being reported in creating in vivo systems that will allow larger scale 

20 production of such modified proteins. 

it is unexpected that a protein, such as an hedgehog protein, modified according 
to the invention, would retain its biological activity. First, the N-terminal cysteine is 
conserved in all known hedgehog protein sequences including fish, frog, insect, bird, 
and mammals. Therefore, it is reasonable to expect that the fi^e sulfliydryl of the N- 
25 tenninal cysteine is important to the protein's structure or activity. Second, hedgehog 
proteins lacking an N-terminal cysteine, due to proteolytic cleavage or mutation to 
hydrophilic amino acids (e.g., aspartic acid or histidine) are inactive in a the cell-based 
C3H10T1/2 assay, such as that described in Example 3. 

There are many modifications of the N-terminal cysteine which protect the thiol 
30 and append a hydrophobic moiety. These modifications are discussed in more detail 
below. One 6f skill in the art is capable of determining which modification is most 
appropriate for a particular therapeutic use. Factors affecting such a determination 
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include cost and ease of production, purification and formulation, solubility, stability, 
potency, pharmacodynamics and kinetics, safety, immunogenicity, and tissue targeting. 

A. Chemical Modifications of Primary Amino Acid Sequence 

5 The chemical modification of the N-terminal cysteine to protect the thiol, with 

concomitant activation by a hydrophobic moiety, can be carried out in numerous ways 
by someone skilled in the art. The sulfliydryl moiety, with the thiolate ion as the active 
species, is the most reactive functional group in a protein. There are many reagents that 
react faster with the thiol than any other groups. See Chemistry of Protein Conjugation 

10 and Cross-Linking (S. S. Wong, CRC Press, Boca Raton, FL, 1991). The thiol of an 
N-terminal cysteine, such as found in all hedgehog proteins, would be expected to be 
more reactive than internal cysteines within the sequence. This is because the close 
proximity to the a-amine will lower the pKa of the thiol resulting in a greater degree of 
proton dissociation to the reactive thiolate ion at neutral or acid pH. In addition, the 

15 cysteine at the N-terminus of the structure is more likely to be exposed than the other 
two cysteines in the hedgehog sequence that are found buried in the protein structure. 
We have shown that the N-terminal cysteine is the only amino acid modified after a 1 h 
reaction with A^-ethyhnaleimide at pH 5.5 (See Example 9), and after a 18 h reaction 
with A^isopropyliodoacetamide at pH 7.0 (See Example 9). Other examples of such 

20 methods would be reaction with other a-haloacetyl compounds, organomercurials, 
disulfide reagents, and other A/-substituted maleimides. Numerous hydrophobic 
derivatives of these active species are available commercially (e.g., ethyl iodoacetate 
(Aldrich, Milwaukee WI), phenyl disulfide (Aldrich), and M-pyrenemaleimide 
(Molecular Probes, Eugene OR)) or could be synthesized readily (e.g., A^- 

25 alkyliodoacetamides (84), iV-alkyhnaleimides (85), and organomercurials (86). We 
have shown that the N-termmal cysteine of human Sonic hedgehog can be specifically 
modified with JV-isopropyliodoacetamide and that the hydrophobically-modified protem 
is 2-fold more potent in the C3H10T1/2 assay than the unmodified protein (See 
Example 9). It is expected that modification of Shh with a long-chain alkyl 

30 iodoacetamide derivative will result in a modified protein with even greater 
enhancement of potency. Such A^-alkyliodoacetamides can be synthesized readily by 
ones skilled in the art, using commercially available starting materials. 
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Another aspect to the reactivity of an N-terminal cysteine is that it can take part 
in reaction chemistries unique to its 1,2-aminothiol configuration. One example is the 
reaction with* thioester groups to form an N-terminal amide group via a rapid S to N 
shift of the thioester. This reaction chemistry can couple together synthetic peptides 
5 and can be used to add single or multiple, natural or unnatural, amino acids or other 
hydrophobic groups via the appropriately activated peptide. Another example, 
demonstrated herein, is the reaction with aldehydes to form the thiazolidine adduct. 
Numerous hydrophobic derivatives of thiol esters (e.g., C2-C24 saturated and 
unsaturated fatty acyl Coenzyme A esters (Sigma Chemical Co., St. Louis MO)), 

10 aldehydes (e.g., butyraldehyde, n-decyl aldehyde, and n-myristyl aldehyde (Aldrich)), 
and ketones (e.g., 2-, 3-, and 4-decanone (Aldrich)) are available commercially or could 
be synthesized readily (87, 88). In a similar manner, thiomorpholine derivatives 
exemplified by the l-bromo-2-butanone chemistry described in Example 9 could be 
prepared from a variety of a-haloketone starting materials (88). Because of the ease of 

15 finding alternative routes to modifying the thiol of the N-teraiinal cysteine, or any 
cysteme in a protein, we do not wish to be bound by the specific examples 
demonstrated'here. 

The a-amine of a protein can be modified preferentially relative to other amines 
in a protein because its lower pKa results in higher amounts of the reactive 

20 unprotonated form at neutral or acidic pH. We have shown that modification of the N- 
temiinal amine with a long chain fatty amide group, while maintaining a free cysteine 
thiol group, activates the hedgehog protein by as much as two orders of magnitude (See 
Example 8). Therefore chemistries that can be directed to react preferentially with the 
N-terminal amine would be expected to be of use in increasing the potency of the 

25 hedgehog proteins. Aryl halides, aldehydes and ketones, acid anhydrides, isocyanates, 
isothiocyanates, imidoesters, acid halides, A^-hydroxysuccinimidyl (e.g., sulfo-NHS- 
acetate), nitrophenyl esters, acylimidazoles, and other activated esters are among those 
known to react with amine fiinctions. 

By replacing the N-terminal cysteine of hedgehog with certain other amino 
30 acids, other chemical methods can be used to add a hydrophobic moiety to the N- 
terminus. One example is to place a serine or threonine at the N-terminus, oxidize this 
amino acid to form an aldehyde, and then conjugate the protein with a chemical moiety 
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containing a 1^ aminothiol structure (e.g., a cysteine). A second example would be to 
place a histidine at the N-terminus to couple to a C-terminal thiocarboxylic acid. 

Chemical modification of other amino acids. 

5 There are specific chemical methods for the modification of many other amino 

acids. Therefore another route for synthesizing a more active-form of hedgehog would 
be to chemically attach a hydrophobic moiety to an amino acid in hedgehog other than 
to the N-terminal cysteine. If an appropriate amino acid is not available at the desired 
position, site-directed mutagenesis could be used to place the reactive amino acid at that 

10 site in the hedgehog structure, whether at the N- or C-terminus or at another position. 
Reactive amino acids would include cysteine, lysine, histidine, aspartic acid, glutamic 
acid, serine, threonme, tyrosine, arginine, methionine, and tryptophan. Thus the goal of 
creating a more hydrophobic form of hedgehog could be attained by many chemical 
means and we do not wish to be restricted by a particular chemistry or site of 

1 5 modification since our results support the generality of this approach. 

The !>-diiehog poi\ [vptiJj can be linked to the h;- v!!v>pl»ul>ij iwy^uiy in a number 
of ways mcluding by chemical coupling means, or by genetic engineering. 

To illustrate, there are a large number of chemical cross-linking agents that are 
known to those skilled in the art. For the present invention, the preferred cross-linking 

20 agents are heterobifimctional cross-linkers, which can be used to link the hedgehog 
polypeptide and hydrophobic moiety in a stepwise manner. Heterobifunctional cross- 
linkers provide the ability to design more specific coupling methods for conjugating to 
proteins, thereby reducmg the occurrences of unwanted side reactions such as homo- 
protein polymers. A wide variety of heterobifimctional cross-linkers are known in the 

25 art. These include: succinimidyl 4-(N-maleimidomethyl) cyclohexane- l-carboxylate 
(SMCC), m-Maleimidobenzoyl-N- hydroxysuccinimide ester (MBS); N-succmimidyl 
(4-iodoacetyl) aminobenzoate (SIAB), succinimidyl 4-(p-maleimidophenyl) butyrate 
(SMPB), l-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDC); 4- 
succmimidyloxycarbonyl- a-methyI-a-(2-pyridyldithio)4olune (SMPT), N-succinimidyl 

30 3-(2-pyridyldithio) propionate (SPDP), succinimidyl 6-[3.(2-pyridyldithio) propionate] 
hexanoate (LC-SPDP). Those cross-linking agents havmg N-hydroxysuccinimide 
moieties can be obtained as the N-hydroxysulfosuccinimide analogs, which generally 
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have greater water solubility. In addition, those cross-linking agents having disulfide 
bridges within the linking chain can be synthesized instead as the alkyl derivatives so as 
to reduce the amount of linker cleavage in vivo. 

In addition to the heterobifunctional cross-linkers, there exists a number of other 
5 cross-linking agents including homobifunctional and photoreactive cross-linkers. 
Disuccinimidyl suberate (DSS), bismaleimidohexane (BMH) and 
dimethylpmielimidate-2 HCl (DMP) are examples of useful homobifunctional cross- 
linking agents, and bis-[B-(4-azidosalicylamido)ethyl]disulfide (BASED) and N- 
succinimidyl-6(4*-azido-2 -nitrophenyl- amino)hexanoate (SANPAH) are examples of 
10 useful photoreactive cross-linkers for use in this invention. For a recent review of 
protein coupling techniques, see Means et al. (1990) Bioconjugate Chemistry 1:2-12, 
incorporated by reference herein. 

One particularly useful class of heterobifunctional cross-linkers, included above, 
contain the primary amine reactive group, N-hydroxysuccinimide (NHS), or its water 
15 soluble analog N-hydroxysulfosuccinimide (sulfo-NHS). Primary amines (lysine 
epsilon groups) at alkaline pH*s are unprotonated and react by nucleophilic attack on 
NHS or sulfo-NHS esters. This reaction results in the formation of an amide bond, and 
release of NHS or sulfo-NHS as a by-product 

Another reactive group useful as part of a heterobifunctional cross-linker is a 
20 thiol reactive group. Common thiol reactive groups include maleimides, halogens, and 
pyridyl disulfides. Maleimides react specifically with free sulfhydryls (cysteine 
residues) in minutes, under slightly acidic to neutral (pH 6.5-7.5) conditions. Halogens 
(iodoacetyl functions) react with -SH groups at physiological pH's. Both of these 
reactive groups result in the formation of stable thioelher bonds. 

25 The third component of the heterobifunctional cross-linker is the spacer arm or 

bridge. The bridge is the structure that connects the two reactive ends. The most 
apparent attribute of the bridge is its effect on steric hindrance. In some instances, a 
longer bridge can more easily span the distance necessary to link two complex 
biomolecules. For instance, SMPB has a span of 14.5 angstroms. 

30 Preparing protein-protein conjugates using heterobifunctional reagents is a two- 

step process involving the amine reaction and the sulfhydryl reaction. For the first step, 
the amine reaction, the protein chosen should contain a primary amine. This can be 
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lysine epsilon amines or a primary alpha amine found at the N-terminus of most 
proteins. The protein should not contain fr^ sulfhydryl groups. In cases where both 
proteins to be conjugated contain free sulfhydryl groups, one protein can be modified so 
that all sulfhydryls are. blocked using for instance, N-ethyhnaleimide (see Partis et al. 
5 (1 983) J. Pro. Chem. 2:263, mcorporated by reference herein). Ellman's Reagent can be 
used to calculate the quantity of sulfhydryls in a particular protein (see for example 
EUman et al.?(1958) Arch. Biochem. Biophys. 74:443 and Riddles et al. (1979) Anal. 
Biochem. 94:75, incorporated by reference herein). 

The reaction buffer should be free of extraneous amines and sulfhydryls. The 
10 pH of the reaction buffer should be 7.0-7.5. This pH range prevents maleimide groups 
from reacting with amines, preserving the maleimide group for the second reaction with 
sulfhydryls. 

The NHS-ester containing cross-linkers have limited water solubility. They 
should be dissolved in a minimal amount of organic solvent (DMF or DMSO) before 
15 introducing the cross-linker into the reaction mixture. The cross-linker/solvent forms 
an emulsion which will allow the reaction to occur. 

The sulfo-NHS ester analogs are more water soluble, and can be added directly 
to the reaction buffer. Buffers of high ionic strength should be avoided, as they have a 
tendency to "salt out" the sulfo-NHS esters. To avoid loss of reactivity due to 
20 hydrolysis, the cross-Unker is added to the reaction mixture immediately after 
dissolving the protein solution. 

The reactions can be more eflBcient in concentrated protein solutions. The more 
alkaline the pH of the reaction mixture, the faster the rate of reaction. The rate of 
hydrolysis of the NHS and sulfo-NHS esters will also mcrease with increasing pH. 
25 Higher temperatures will increase the reaction rates for both hydrolysis and acylation. 

Once the reaction is completed, the first protein is now activated, with a 
sulfhydryl reactive moiety. The activated protein may be isolated from the reaction 
mixture by simple gel filtration or dialysis. To carry out the second step of the cross- 
linking, the sulfhydryl reaction, the lipophilic group chosen for reaction with 
30 maleunides, activated halogens, or pyridyl disulfides must contain a free sulfhydryl. 
Alternatively, a primary amine may be modified with to add a sulfhydryl 
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In all cases, the buffer should be degassed to prevent oxidation of sulfhydryl 
groups. EDTA may be added to chelate any oxidizing metals that may be present in the 
buffer. Buffers should be free of any sulfhydryl containing compounds. 

Maleimides react specifically with -SH groups at slightly acidic to neutral pH 
5 ranges (6.5-7.5). A neutral pH is sufficient for reactions involving halogens and pyridyl 
disulfides. Under these conditions, maleimides generally react with -SH groups within 
a matter of minutes. Longer reaction times are required for halogens and pyridyl 
disulfides. 

The fu^t sulfhydryl reactive-protein prepared in the amine reaction step is 
10 mixed with the sulfhydiyl-containing lipophilic group under the appropriate buffer 
conditions. The conjugates can be isolated from the reaction mixture by methods such 
as gel filtration or by dialysis. 

Exemplary activated lipophilic moieties for conjugation include: N-(l- 
pyrene)malemxide; 2,5-dimethoxystilbene-4'-maleimide, eosin-5-maleimide; 

1 5 fluorescein-5-maleimide; N-(4-(6-dimethylamino- 2-benzofuranyl)phenyl)maleimide; 
benzophenone-4-maleimide; 4-dimethylaminophenyla2ophenyl- 4-maIeimide 
(DABMI), tetramethylrhodamine-5-maleimide, tctramethylrhodamine-6-maleimide, 
Rhodamine RedTM C2 maleimide, N-(5-aniinopentyl)maleimide, trifluoroacetic acid 
salt, N-(2-aminoethyl)maleimide, trifluoroacetic acid salt, Oregon GreenTM 488 

20 maleimide, N-(2-((2-(((4-azido- 2,3,5,6-tetrafluoro)benzoyl) 

amino)ethyl)dithio)ethyl)maleimide (TFPAM-SSl), 2-(l-(3-dimethyiaminopropyl) - 
indol-3-yl)-3-(indol-3-yl) maleimide (bisindolyhnaleimide; GF 109203X), B0DIPY<8) 
FL N-(2-aminoethyl)maleimide, N-(7-dimethylamino- 4-methyIcoumarin-3- 
yl)maleimide (DACM), AlexaTM 488 C5 maleimide, AlexaTM 594 C5 maleimide, 

25 sodium saltN-(l-pyrene)maleimide, 2,5-dimethoxystilbene-4 -maleimide, eosin-5- 
maleimide, fluorescein-5-maleimide, N-(4-(6-dimethyIamino- 2- 

benzofuranyl)phenyl)maleimide, benzophenone-4-maleimide, 4- 

dimethylaminophenylazophenyl- 4*-ma]eimide, l-(2-maleimidylethyl)-4-{5- (4- 
methoxyphenyI)oxazol-2- yl)pyridinium methanesulfonate, tetramethylrhodamine-5- 

30 maleimide, tetramethylrhodamine-6-maleimide, Rhodamine RedTM C2 maleimide, N- 
(5-aminopentyl)maleimide, N-(2-aminoethyl)maleimide, N-(2-((2-(((4-azido- 2,3,5,6- 
tetrafluoro)ben2oyl) amino)ethyl)dithio)ethyl)maleimide, 2-(l-(3- 
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dimethylaminopropyl) -indol-3-yI)-3-(indol-3-yi) maleimide, N-(7-dimethylamino- 4- 
methylcoumarin-3-yl)maleimide (DACM), llH-Benzo[a]fluorene» Ben2o[a]pyrene. 

In one embodiment, the hedgehog polypeptide can be derivatived using pyiene 
maleimide, which can be purchased from Molecular Probes (Eugene, Oreg.), e.g., N-(l- 
5 pyrene)nialeimide or l-pyrenemethyl iodoacetate (PMIA ester). As illustrated in 
Figure 1, the pyrene-derived hedgehog protein had an activity profile indicating that it 
was nearly 2 orders of magnitude more active than the unmodified form of the protein, 

B. Making Hydrophobic Peptide Derivatives 

10 According to the invention, the protein can also be modified using a 

hydrophobic peptide. As used herein, the term "peptide" includes a sequence of at least 
one amino acid residue. Preferably, the peptide has a length between one amino acid 
and 18-26 amino acids, the latter being the typical length of a membrane spanning 
segment of a protein. To create a peptide with hydrophobic character, the amino acids 

15 are selected predominantly from the following hydrophobic amino acids: 
phenylalanine, isoleucine, leucine, valine, methionine, tryptophan, alanine, proline, and 
tyrosine. The hydrophobic peptide can also contam unnatural amino acid analogs with 
hydrophobic character or D-amino acids, peptoid bonds, N-terminal acetylation or other 
features that decrease the peptide^s susceptibility to proteolysis. Methods for 

20 substituting unnatural amino acids at specific sites in proteins are known (78, 79). 

Generally, a hydrophobic peptide is appended to various sites on a protein. One 
site can be the^ N-terminal residue. Altematively, the hydrophobic peptide is substituted 
in place of the N-terminal residue. In another embodiment, a hydrophobic peptide is 
appended to the C-terminus of the protein. Altematively, the hydrophobic peptide is 

25 substituted in place of the C-terminal residue. The C-terminus can be the native C- 
terminal amino acid but it may also be the C-terminus of a truncated protein so that the 
hydrophobic peptide is appended to the final C-terminal amino acid of the truncated 
form, which is still referred to as the "C-terminus". A truncated hedgehog protein will 
retain activity when up to eleven amino acids are deleted from the native C-terminal 

30 sequence. The hydrophobic peptide may also be inserted between the N-terminal 
residue and the internal residue immediately adjacent to the N-terminal residue, or 
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between the C-terminal residue and the residue inunediately adjacent to the C-terminal 
residue, or between two internal residues. 

In certain embodiments, the lipophilic moiety is an amphipathic polypeptide, 
such as magainin. cecropin, attacin, melittin, gramicidin S, alpha-toxin of Staph. 
5 aureus, alamethicin or a synthetic amphipathic polypeptide. Fusogenic coat proteins 
from viral particles can also be a convenient source of amphipathic sequences for the 
subject hedgehog proteins 

C, Making Lipid Derivatives 

10 Another form of protein encompassed by the invention is a protein derivatized 

with a variety of lipid moieties. Generally, a "lipid" is a member of a heterogenous 
class of hydrophobic substances characterized by a variable solubility in organic 
solvents and insolubility, for the most part, in water. The principal classes of lipids that 
are encompassed within this invention are fatty acids and sterols (e.g., cholesterol). 

15 Derivatized proteins of the invention contain fatty acids which are cyclic, acyclic (i.e., 
straight chain), saturated or xmsaturated, mono-carboxylic acids. Exemplary saturated 
fatty acids have the generic formula: CHj (CH^n COOH. The following table lists 
examples of some fatty acids that can be derivatized conveniently using conventional 
chemical methods. 

20 

TABLE 2: Exemplary Saturated and Unsaturated Fatty Acids 
Saturated Acids: CHj (CH2)n COOH 

Value of n Common Name 

2 butyric acid 

25 4 caproic acid 

6 caprylic acid 

8 capric acid 

10 lauric acid 

12 myristic acid* 

30 14 palmitic acid* 

16 stearic acid* 

18 arachidic acid* 

20 behenic acid 

22 lignoceric acid 



i 
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Unsaturated Acids 



CH3CH=CHC00H 



crotonic acid 
myristoleic acid* 
paimitoleic acid* 
oleic acid* 
iinoleic acid . 
iinolenic acid 
arachidonic acid 



CH3(CH,)3CH=CH(CH2)7COOH 



5 CH3(CH2)5CH=CH (CH2)7COOH 
CH3(CH2)7CH=CH(CH2)7COOH 



CH3(CH2)3(CH2CH=CH)2(CH2)7COOH 



CH3(CH2CH=CH)3(CH2)7COOH 



CH3(CHj)3(CH2CH=CH)4(CH2)3COOH 



10 The asterisk (*) denotes the fatty acids that we found in recombinant hedgehog protein 
secreted from a soluble construct. 

Other lipids that can be attached to the protein include branched-chain fatty 
acids and those of the phospholipid group such as the phosphatidylinositols (i.e., 
15 phosphatidylinbsitol 4-monophosphate and phosphatidylinositol 4,5-biphosphate), 
phosphatidycholme, phosphatidylethanolamine, phosphatidylserine, and isoprenoids 
such as famesyl or geranyl groups. 

We have demonstrated that lipid-modified hedgehog proteins can be purified 
from either a natural source, or can be obtained by chemically modifying the soluble, 

20 unmodified protein. For protein purified firom a natural source, we showed that when 
full-length human Sonic hedgehog (Shh) was expressed in insect cells and membrane- 
bound Shh pxirified from the detergent-treated cells using a combination of SP- 
Sepharose chromatography and immunoaffinity chromatography, that the purified 
protein migrated on reducing SDS-PAGE gels as a single sharp band with an apparent 

25 mass of 20 kDa (See Example 1). The soluble and membrane-bound Shh proteins were 
readily distinguishable by reverse phase HPLC, where the tethered forms eluted later in 
the acetonitrile gradient (See Example 1 and Figure 3). We then demonstrated that 
human Sonic hedgehog is tethered to cell membranes in two forms, one form that 
contains a cholesterol, and therefore is analogous to the data reported previously for 

30 Drosophila hedgehog (18), and a second novel form that contains both a cholesterol 
and a palmitic acid modification. Soluble and tethered forms of Shh were analyzed by 
electrospray mass spectrometry using a triple quadrupole mass spectrometer, equipped 
with an electrospray ion source (Example 1) as well as by liquid chromatography-mass 
spectrometry (See Example 1). The identity of the N-terminal peptide fix)m 
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endoproteinase Lys-C digested tethered Shh was confirmed by MALDI PSD mass 
spectrometric measurement on a MALDI time of flight mass spectrometer. The site of 
palmitoylation was identified through a combination of peptide mapping and sequence 
analysis and is at the N-terminus of the protein (residue 1 of the sequence of the mature 
5 protein in SEQ ID NOS: 1-4). Both tethered forms were equally as active in the 
C3H10T1/2 alkaline phosphatase assay, but interestingly both were about 30-times 
more potent than soluble human Shh lacking the tether(s). The lipid modifications did 
not significantly affect the apparent binding affinity of Shh for its receptor, patched 
(Figure 7). 

10 We next tested the utility of the derivatized forms by assaying the relative 

potencies of soluble and tethered Shh alone or in the presence of the anti-hedgehog 
neutralizing Mab 5E1 on C3H10T1/2 cells measuring alkaline phosphatase induction. 
Moreover, the relative potency of soluble and tethered Shh for binding to patched was 
assessed onpa/cAerf-transfected EBNA-293 cells by FACS analysis (Example 3). 

15 For lipid-modified hedgehog obtained by chemically modifying the soluble, 

unmodified protein, we have showed that pahnitic acid and other lipids can be added to 
soluble Shh to create a lipid-modified forms with increased potency in the C3H10T1/2 
assay (Example 8). We have shown (Examples 1, 2, and 8) that the. thiol and a-amine 
on the N-terminal cysteine contribute to the lipid derivatization reaction. Without 

20 wishing to be bound by any particular theory, lipid modification on proteins starts with 
the formation of a thioester intermediate and the lipid moiety Is then transferred to the 
a-amine of the N-terminus through the formation of a cyclic intermediate. Generally, 
therefore, the reactive lipid moiety can be in the form of thioesters of saturated or 
unsaturated carboxylic acids such as a Coen2yme A thioesters. Such materials and 

25 their derivatives may include, for example, commercially available Coenzyme A 
derivatives such as palmitoleoyl Coenzyme A, arachidoyl Coenzyme A, arachidonoyl 
Coenzyme A, lauroyl Coenzyme A and the like. These materials are readily available 
fi-om Sigma Chemical Company (St. Louis, MO., 1998 catalog pp. 303-306). 

The effect of different lipid moieties on functional activity of hedgehog protein 
30 has been assayed (See Example 8 and Figures 10 and 11). Similarly, the effect of 
different lipid moieties on functional activity of other proteins such as those described 
above in Section III, may be conveniently tested using methods known to workers of 
ordinary skill. For instance, functional testing of gelsolin (50), various interferons 
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(interferon-a, interferon-p and interferon-y), the various interleukins (e.g., IL-1 , -2, -3, 
-4, and the like), the tumor necrosis factors-a and -p, and other growth factors that are 
lipid-modified according to the invention can be accomplished using well known 
methods. 

5 Although we have established chemical means by which a fatty acid can be 

attached to the N-terminal cysteine of hedgehog proteins, it might be expected that 
lipids can be attached to the same or other sites using enzymically catalyzed reactions. 
Palmitoylation of proteins irt vivo is catalyzed by a class of enzymes known as 
palmitoyl-CoA:protein S-palmitoyltransferases. Using purified enzymes, in vitro 

10 acylation of protein substrates has been demonstrated (80, 81). The substrate 
specificities of the palmitoyltransferase enzymes are not well defined; an analysis of 
palmitoylation sites of cellular and viral proteins finds little in the way of a consensus 
sequence surrounding the modified cysteine residue, but suggests a common presence 
of a lysine or arginine residue within two amino acids of the cysteine, and large, 

15 hydrophobic amino acids near the cysteine. The amino-terminal sequence of Shh, 
CGPGRGFG, may fit this consensus sequence and serve as a recognition site for 
palmitoylation. 

As an alternative, myristoylation of the amino terminus of hedgehog proteins 
could be carried out using an N-myristoyl transferase (NMT), a niunber of which have 

20 been well characterized in both mammals (82) and in yeast (83). A recognition site for 
N-myristoyltransferase could be engineered into the hedgehog N-terminal sequence to 
facilitate recognition by the en2yme. Both of these strategies would require the use of 
fatty acyl-coenzyme A derivatives as substrates, as are used for the non-enzymic fatty 
acylation of human Sonic hedgehog described in Example 8. Alternatively, a protein 

25 with an engmeered recognition sequence may be myristoylated during expression in a 
suitable cell line. Another method of modifying a protein such as hedgehog with a 
hydrophobic moiety is to create a recognition site for the addition of an isoprenoid 
group at the C-terminus of the protein. The recognition site for famesyl and geranyl- 
geranyl addition are known and the protein may be modified during expression in a 

30 eukaryotic cell (Gelb et al.. Cur. Opin. Chem. Biol, 2: 40-48 (1998)). 

VI. Multimeric Pr tein Complexes 
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Hydrophobically-modified proteins described herein are particularly amenable 
to being made into multimeric protein complexes. Multimeric protein complexes of the 
invention include proteins, optionally attached via their hydrophobic (e.g., lipid) 
moiety to a vesicle. The vesicle may be a naturally occurring biological membrane, 

5 purified away from natural material, or the vesicle may be a synthetic construction. 
Preferred vesicles are substantially spherical structures made of amphiphiles, e.g., 
surfactants or phospholipids. The lipids of these spherical vesicles are generally 
organized in the fomi of lipids having one or more structural layers, e.g., multilamellar 
vesicles (multiple onion-like shells of lipid bilayers which encompass an aqueous 

1 0 volxmie between the bilayers) or micelles. 

In particular, liposomes are small, spherical vesicles composed primarily of 
various types of lipids, phospholipids, and secondary lipophilic components. These 
components are normally arranged in a bilayer formation, similar to the lipid 
arrangement of biological membranes. 

1 5 Typically, the polar end of a component lipid or lipid-like molecule is in contact 

with the surrounding solution, usually an aqueous solution, while the non-polar, 
hydrophobic end of the lipid or lipid-like molecule is in contact with the non-polar, 
hydrophobic end of another hpid or lipid-like molecule. The resulting bilayer 
membrane (i.e., vesicle) is selectively permeable to molecules of a certain size, 

20 hydrophobicity, shape, and net charge. Most vesicles are lipid or lipid-like in nature, 
although alternative liposome bilayer formulations, comprising a surfactant with either 
a lipid or a cholesterol, exist 

Liposome vesicles may be particularly preferred in that they find many 
therapeutic, diagnostic, industrial, and commercial applications. They are used to 

25 deliver molecules which are not readily soluble in water, or when directed timed release 
is desired. Because of their selective peraneability to many chemical compounds, 
liposomes are useful as delivery vehicles for drugs and biological materials. Thus, 
lipid-derivatized proteins such as hedgehog can be made multimeric by being 
incorporated into the lipid bilayer of liposome vesicles. Upon reaching the target site, 

30 the liposomes may be degraded (for example, by enzymes in the gastro-intestinal tract) 
or they may fuse with the membranes of cells. 

Several methods of preparing vesicles such as liposomes are known. The 
production of phospholipid vesicles is well known (53). For a general review of 
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commonly used methods, see (54). Among die more common of these are (1) 
sonication of a solution containing lipids sometimes followed by 
evaporation/lyophilization and rehydration (see, e.g. Stryer, Biochemistry, pp. 290- 
291, Freeman & Co., New York, (1988), and (55); (2) homogenizaUon of a lipid 
5 solution, sometimes at high pressure or high shearing force (see e.g. U.S. Pat. No. 
4,743.449 issued 10 May 1988, and U.S. Pat. No. 4,753,788, issued 28 Jun. 1988), (3) 
hydration and sometimes sonication of a dried fihn of vesicle-forming lipids wherein 
the lipid fihn is prepared by evaporation of a solution of lipids dissolved in an organic 
solvent (see e.g. U.S. Pat. No. 4,452,747 issued 5 Jun. 1984, U.S. Pat. No. 4,895,719 

10 issued 23 Jan. 1990, and U.S. Pat. No. 4,946,787 issued 7 Aug. 1990), (4) 
lyophilization or evaporation and rehydration (see e.g. U.S. Pat. No. 4,897,355 issued 
30 Jan. 1990, rEP 267,050 published 5 Nov. 1988, U.S. Pat. No. 4,776,991 issued II 
Oct. 1988, EP 172,007 published 19 Feb. 1986, and Australian patent application AU- 
A-487 13/85 published 24 Apr. 1986), (5) solvent injection or infusion of a lipid 

15 solution into an aqueous medium or vice versa (see e.g. (56); U.S. Pat. No. 4,921,757 
issued 1 May 1990. U.S. Pat. No. 4,781,871 issued 1 Nov. 1988, WO 87/02396 
published 24 Mar. 1988, and U.S. Pat. No. 4,895,452 issued 23 Jan. 1990), (6) spray 
drying (see e.g. Australian patent application AU-A-48713/85 published 24 Apr. 1986, 
and U.S. Pat. No. 4,830,858 issued 16 May 1989), (7) filtration (see e.g. WO 

20 85/01161), (8) reverse-phase evaporation. See e.g. (57); and (9) combinations of the 
above methods. See e.g, (58) and (59). 

Preferred lipids and lipid-like components suitable for use in preparing vesicles 
include phospholipids, a mixture of phospholipids, polar lipids, neutral lipids, fatty 
acids, and their derivatives. A preferred lipid has the characteristic that when dispersed 

25 alone in water, at a temperature above tiie lipid transition temperature, they are in a 
lipid emulsion phase. In certain embodiments, the lipid is a single-aliphatic chain of 
greater than about 12 carbons and can be either saturated or unsaturated, or substituted 
in other ways. Suitable lipids include the ester, alcohol, and acid forms of the 
following fatty acids: stearate, oleic acid, linoleic acid, arachidate, arachidonic acid, and 

30 other single-aliphatic chains acids. Further candidates include the ester, alcohol, and 
acid forms of tiie retinols, in particular, retinol and retinoic acid. Other preferred lipids 
include phosphatidylcholine (PC), phosphatidylglycerol (PG) and their derivatives, 
created synthetically or derived fi*om a variety of natural sources. 
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In certain embodiments, the vesicle may be stabilized sterically by the 
incorporation of polyethylene glycol (PEG), or by the PEG headgroups of a synthetic 
phospholipid (PEG conjugated to distearoyl phosphatidylethanolamine (DSPE), see e.g. 
(61)). Preferred surfactants are those with good miscibility such as Tween™, Triton ™, 
5 sodium dodecyl sulfate (SDS), sodium laurel sulfate, or sodium octylglycoside. 

Preferred surfactants form micelles when added to aqueous solution above the 
surfactants phase transition temperature. The surfactants may be composed of one or 
more aliphatic chains. These aliphatic chains may be saturated, unsaturated, or 
substituted in other ways, such as by ethoxylation; typically the aliphatic chain contains 

10 greater than about 12 carbons. Additional suitable surfactants include the following: 
lauryl-, myristyl-, linoleyl-, or stearyl-sulfobetaine; lauryl-, myristyl-, Imoleyl- or 
stearyl-sarcosine; linoleyl, myristyl-, or cetyl- betaine; lauroamidopropyl-, 
cocamidopropyl-, linoleamidopropyl-, myristamidopropyl-, palmidopropyl-, or 
isostearamidopropyl-betaine (e.g, lauroamidopropyl); myristamidopropyl-, 

15 pahnidopropyl-, or isostearamidopropyl-dimethylamine; sodium methyl cocoyl-, or 
disodium methyl oleyl-taurate; and the MONAQUAT series (Mona Industries, Inc., 
Paterson, N.J.), See also Example 4. 

Preferred sterols and sterol esters suitable for use in preparing multimeric 
protein complexes include cholesterol, cholestanol, cholesterol sulfate, and other 

20 cholesterol analogs and derivatives. The fact that a vesicle may comprise many 
different lipids and detergents allows great flexibility in engineering a tethered protein- 
vesicle complex with desired properties. For example, one may produce vesicles that 
bind different number of proteins by varying the lipid composition of the starting 
materials to create larger vesicles, or by mcreasing the percentage of 

25 phosphatidylinositol lipids in the vesicle. 

VTL Utilities 

Generally, the modified proteins described herein are useful for treating the 
same medical conditions that can be treated with the unmodified forms of the proteins. 
30 However, the hydrophobically-modified proteins described herein provide several 
significant improvements over the unmodified forms. First, their increased potency 
enables treatnient with smaller amounts of protein and over shorter periods of time. 
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This will be important in both systemic and CNS applications. Secondly, replacement 
of the N-terminal cysteine with a less chemically reactive amino acid allows for easier 
production, formulation, and storage of a protein for clinical use. Thirdly, the 
pharmacodynamics of a protein will be altered by hydrophobic modification and this 
5 will allow the proteins to be localized in the vicinity of the site of administration, thus 
increasing their safety, by minimizing systematic exposure, and their effectiveness by 
increasing their local concentration. The proteins of the invention are also useful in 
diagnostic compositions and methods to detect their corresponding receptors. 

As an example of the first point, it has been found that the half-life of hedgehog 
10 is very short after systemic application and that multiple injections are required to 
achieve a robust response to the protein. The higher potency of the modified forms and 
the possibility of formulation in liposomes provides a means of achieving a response 
with fewer treatments. For CNS applications, the higher potency provides a means to 
supply an adequate amount of protein in the small volumes required for direct injection 
15 into the CNS. 

The importance of the second point is illustrated by the fact that we have found 
that the N-terminal cysteine of hedgehog is highly susceptible to chemical attack, either 
to form other chemical adducts or to oxidatively-dimerize with another hedgehog 
protein. To prevent this, special buffers and procedures are used during purification, 
20 and dithiothreitol is used in the final formulation. These precautions necessitate carefiil 
evaluation of the effects of the formulation buffer in animal models. 

As an example of the third point, the more limited the range over which a 
protein diffuses away fi:om the site of administration, the higher the local concentration. 
This higher local concentration may therefore allow more specific clinical responses 
25 during the treatment of neurological disorders after direct injection into the desired 
region of the brain or spinal cord. 

Similarly, the modified proteins can be administered locally to the site of bone 
fractures to help heal these fi:actures, in the gonads to treat fertility disorders, 
intraocularly to treat eye disorders, and under the skin to treat dermatological 
30 conditions, and to stimulate local hair growth. Localization of the hydrophobically- 
modified proteins to the site of -administration therefore reduces possibly undesirable 
systemic exposure to other tissues and organs. 
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For therapeutic use, hydrophobically-modified proteins of the invention are 
placed into pharmaceutically acceptable, sterile, isotonic formulations and optionally 
are administered by standard means well known in the field. The formulation is 
preferably liquid or may be lyophilized powder. It is envisioned that a therapeutic 
5 administration of, for instance, a multimeric protein complex may comprise liposomes 
incorporating the derivatized proteins described herein. 

It will be appreciated by persons having ordinary skill in the art that the 
particular administration, dosage, and clinical ^plications of a hydrophobically- 
modified protein of the invention will vary depending upon the particular protein and 
1 0 its biological activity. 

As but one example of the application of the proteins of this invention in a 
therapeutic context, therapeutic hydrophobically-modified hedgehog proteins can be 
admmistered to patients suffering from a variety of neurological conditions. The ability 
of hedgehog protein to regulate neuronal differentiation during development of the 

IS nervous system and also presumably in the adult state indicates that hydrophobically- 
modified hedgehog can reasonably be expected to facilitate control of adult neurons 
with regard to^.maintenance, fimctional performance, and aging of normal cells; repair 
and regeneration processes in lesioned cells; and prevention of degeneration and 
premature death which results fiom loss of differentiation in certain pathological 

20 conditions. In light of this, the present hydrophobically-modified hedgehog 
compositions, by treatment with a local infiision can prevent and/or reduce the severity 
of neurological conditions deriving from: (i) acute, subacute, or chronic injury to the 
nervous system, including traumatic injtiry, chemical injury, vessel injury, and deficits 
(such as the ischemia from stroke), together with infectious and tumor-induced injury; 

25 (ii) aging of the nervous system including Alzheimer's disease; (iii) chronic 
neurodegenerative diseases of the nervous system, including Parkinson's disease, 
Huntington's chorea, amylotrophic lateral sclerosis and the like; and (iv) chronic 
immimological diseases of the nervous system, including multiple sclerosis. The 
hydrophobically-modifed protein may also be injected into the cerebrospinal fluid, e.g., 

30 in order to address deficiencies of brain cells, or into the lymph system or blood stream 
as required to target other tissue or organ system-specific disorders. 

Hedgehog compositions of the invention may be used to rescue, for example, 
vanous neurons fix)m lesion-induced death as well as guiding reprojection of these 
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neurons after such damage. Such damage can be attributed to conditions that include, 
but are not limited to, CNS trauma infarction, infection, metabolic disease, nutritional 
deficiency, and toxic agents (such as cisplatin treatment). Certain hedgehog proteins 
cause neoplastic or hyperplastic transformed cells to become either post-mitotic or 
5 apoptotic. Such compositions may, therefore, be of use in the treatment of, for 
instance, malignant gliomas, medulloblastomas and neuroectodermal tumors. 

The proteins may also be linked to detectable markers, such as fluoroscopically 
or radiographically opaque substances, and administered to a subject to allow imaging 
of tissues which express hedgehog receptors. The proteins may also be bound to 

10 substances, such as horseradish peroxidase, which can be used as immunocytochemical 
stains to allow visualization of areas of hedgehog ligand-positive cells on histological 
sections. Hydrophobically-modified proteins of the invention, either alone or as 
multivalent protein complexes, can be used to specifically target medical therapies 
against cancers and tumors which express the receptor for the protein. Such materials 

15 can be made more effective as cancer therapeutics by using them as delivery vehicles 
for antineoplastic drugs, toxins, and cytocidal radionuclides, such as yttrium 90. 

A toxin may also be conjugated to hydrophobically-modified hedgehog (or 
vesicle-containing multivalent complexes thereof) to selectively target and kill 
hedgehog-responsive cells, such as a tumor expressing hedgehog receptor(s). Other 

20 toxins are equally useful, as known to those of skill in the art. Such toxins include, but 
are not limited to, Pseudomonas exotoxin. Diphtheria toxin, and saporin. This 
approach should prove successful because hedgehog receptor(s) are expressed in a very 
limited number of tissues. Another approach to such medical therapies is to use 
radioisotope labeled, hydrophobically-modified protein (or multivalent complexes 

25 thereof). Such radiolabeled compounds will preferentially target radioactivity to sites 
in cells expressing the protein receptor(s), sparing normal tissues. Depending on the 
radioisotope employed, the radiation emitted from a radiolabeled protein bound to a 
tumor cell may also kill nearby malignant tumor cells that do not express the protein 
receptor. A variety of radionuclides may be used. Radio-iodine (for example, ^^*I) has 

30 been successful when employed with monoclonal antibodies against CD20 present on 
B-cell lymphomas (63). 

The protein compositions to be used in therapy will be formulated and dosages 
established in a fashion consistent with good medical practice taking into account the 
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disorder to be treated, the condition of the individual patient, the site of delivery of the 
isolated polypeptide, the method of administration, and other factors known to 
practitioners. The therapeutic may be prepared for administration by mixing a protein, 
a protein-containing vesicle, or a derivatized complex at the desired degree of purity 
5 with physiologically acceptable carriers (i.e. carriers which are nontoxic to recipients at 
the dosages and concentrations employed). 

It is envisioned that local delivery to the site will be the primary route for 
therapeutic administration of the proteins of this invention. Intravenous dehvery, or 
delivery through catheter or other surgical tubing may also be envisioned. Alternative 
10 routes include tablets and the like, commercially available nebulizers for liquid 
formulations, and inhalation of lyophilized or aerosolized formulations. Liquid 
formulations may be utilized after reconstitution from powder formulations. 

The dose administered will be dependent upon the properties of the vesicle and 
protein employed, e.g. its binding activity and in vivo plasma half-life, the 

1 5 concentration of the vesicle and protein in the formulation, the administration route, the 
site and rate of dosage, the clinical tolerance of the patient involved, the pathological 
condition afflicting the patient and the like, as is well known within the skill of the 
physician. Generally, doses of from about 5 x 10" ' to 5 x 10"' Mdlar of protein per 
patient per administration are preferred, although the dosage will depend on the nature 

20 of the protem. Different dosages may be utilized during a series of sequential 
administrations. 

The invention is also directed towards a pharmaceutical formulation which 
includes a hedgehog protein modified according to the invention in combination with a 
pharmaceutically acceptable carrier. In one embodiment, the formulation also includes 
25 vesicles. 

The hydrophobically-modified hedgehog proteins of the invention are also 
useful in gene therapy methods. 

For neurodegenerative disorders, several animal models are available that are 
believed to have some clinical predicative value. For Parkinson's disease, models 
30 involve the protection, or the recovery in rodents or primates in which the nigral-striatal 
dopaminergic pathway is damaged either by the systemic administration of MPTP or 
the local (mtracranial) administration of 6-hydroxydopamine [6-OHDA], two selective 
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dopaminergic toxins. Specific models are: MPTP- treated mouse model (64); MPTP- 
treated primate (marmoset or Rhesus) model (65), and the unilateral 6-OHDA lesion rat 
model (66). For ALS. (Amyotrophic lateral sclerosis) models involve treatment of 
several mice strains that show spontaneous motor neuron degeneration, including the 
5 wobbler (67) and pmn mice (68), and of transgenic mice expressing the human mutated 
superoxidase dismutase (hSOD) gene that has been linked to familial ALS (69). For 
spinal cord iniurv. the most common models involve contusion injury to rats, either 
through a calibrated weight drop, or fluid (hydrodynamic) injury (70). For 
Huntington's, models involve protection from excitotoxin (NMDA, quinolinic acid, 

10 kainic acid, 3-nitro-propionic acid, APMA) lesion to the striatum in rats (71, 72). 
Recently, a model of transgenic mice overexpressing the human trinucleotide expanded 
repeat in the huntingtin gene has also been described (73). For multiple sclerosis. EAE 
in mice and rats is induced by immunization with MBP (myelin basic protein), or 
passive transfer of T cells activated with MBP (74), For Alzheimer's, a relevant murine 

1 5 model is a determination of protection against lesion of the fimbria-fornix in rats (septal 
lesion), the main nerve bxmdle supplying the cholinergic innervation of the 
hippocampus (75), as well as use of transgenic mice overexpressing the human beta- 
amyloid gene. For peripheral neuropathies, a relevant model is protection against loss 
of peripheral nerve conductance caused by chemtherapeutic agents such as taxol, 

20 vincristine, and cisplatin in mice and rats (76). 

TTiis mvention will now be described more fiilly with reference to the following, 
non-limiting, ^Examples. 

25 Example 1: Human Sonic Hedgehog is Lipid-Modificd when Expressed in Insect 
Cells 

A. Expression of Human Sonic Hedgehog 

The cDNA for full-length human Sonic hedgehog (Shh) was provided as a 1.6 
30 kb EcoRI fragment subcloned into pBluescript SK* (20) (a gift of David Bumcrot from 
Ontogeny, Inc., Cambridge MA). The 5' and 3' NotI sites immediately flanking the Shh 
open reading frame were added by unique site elimination mutagenesis using a 
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Phannacia kit following the manufacturer's recommended protocol. The 1.4 kb NotI 
fragment carrying the full-length Shh cDNA was then subcloned into the insect 
expression vector, pFastBac (Life Technologies, Inc.). Recombinant baculovirus was 
generated using the procedures supplied by Life Technologies, Inc. The resulting virus 
5 was used to create a high titer virus stock. Methods used for production and 
purification of Shh are described below. The presence of membrane-associated Shh was 
examined by FACS and by Western blot analysis. Peak expression occurred 48 h post- 
infection. For Westem blot analysis, supematants and cell lysates from Shh-infected or 
uninfected cells were subjected to SDS-PAGE on a 10-20% gradient gel under reducing 
1 0 conditions, transferred electrophoretically to nitrocellulose, and the Shh detected with a 
rabbit polyclonal antiserum raised against an N-terminal Shh 15-mer peptide-keyhole 
limpet hemocyanin conjugate. The cell lysates were made by incubating the cells for 5 
min at 25**C in 20 mM Na^HPO^ pH 6.5, 1% Nonidet P-40 and 150 mM NaCl or 20 
mM Tris-HCl pH 8.0, 50 mM NaCl, 0.5% Nonidet P.40 and 0.5% sodium 
15 deoxycholate and then pelleting particulates at 13,000 rpm for 10 min at 4^C in an 
Eppendorf centrifuge. 

B. Purification of Membrane-Tethered Human Sonic Hedgehog 

The membrane-tethered form of Shh was produced in High Five™ insect cells 
(Invitrogen) using die recombinant baculovirus encoding fiill-lengtii Shh discussed 
above. High Five™ cells were grown at 28*^0 in sf900 II serum free medium (Life 
Technologies, Inc.) in a 10 L bioreactor controlled for oxygen. The cells were infected 
in late log phase at ca. 2x10* cells/mL with virus at a MOI of 3 and harvested 48 h 
after infection (cell viability at the time of harvest was > 50%). The cells were 
collected by centrifugation and washed in 10 mM Na2HP04 pH 6.5. 150 mM NaCl pH 
plus 0.5 mM PMSF. The resulting cell pellet (150 g wet weight) was suspended in 1.2 
L of 10 mM NajHP04 pH 6.5, 150 mM NaCl, 0.5 mM PMSF, 5 jiM pepstatin A, 10 
jig/mL leupeptin, and 2 ^ig/mL E64, and 120 mL of a 10% solution of Triton X-100 
was then added. 

After a 30 min incubation on ice, particulates were removed by centrifugation 
(1500 X g, 10 min). All subsequent steps were performed at 4-6**C. The pH of the 
supernatant was adjusted to 5.0 witii a stock solution of 0.5 M MES pH 5.0 (50 mM 
final) and loaded onto a 150 ml SP-Sepharose Fast Flow column (Pharmacia). The 
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column was washed with 300 mL of 5 mM NajHPO^ pH 5.5, 150 mM NaCI, 0.5 mM 
PMSF, 0.1% Nonidet P-40, then with 200 mL of 5 mM NajHP04 pH 5.5, 300 mM 
NaCl, 0.1% Nonidet P-40, and bound hedgehog eluted with 5 mM Na^HPO^ pH 5.5, 
800 mM NaCI, 0.1% Nonidet P-40. 

5 The Shh was next subjected to immunoaffinity chromatography on 5E1- 

Sepharose resin that was prepared by conjugating 4 mg of antibody per mL of CNBr 
activated Sepharose 4B resin. The SP-Sepharose elution pool was diluted with two 
volumes of 50 mM HEPES pH 7.5 and batch loaded onto the 5E1 resin (1 h). The resin 
was collected in a column, washed with 10 column volumes of PBS containing 0.1% 
10 hydrogenated Triton X-100 (Calbiochem), and eluted with 25 mM NaH^PO^ pH 3.0, 
200 mM NaCl, 0.1% hydrogenated Triton X-100. The elution fractions were 
neutralized with 0.1 volume of IM HEPES pH 7.5 and analyzed for total protein 
content from absorbance measurements at 240-340 nm and for purity by SDS-PAGE. 
Fractions were stored at -70**C. 

15 Peak fractions from three affinity steps were pooled, diluted with 1.3 volumes 

of 50 mM HEPES pH 7.5, 0.2% hydrogenated Triton X-100 and again batch loaded 
onto the 5E1 resin. The resin was collected in a column, washed widi three colunm 
volumes of PBS pH 7.2, 1% octylglucoside (US Biochemical Corp.), and eluted with 
25 mM NaH2P04 pH 3.0, 200 mM NaCl, 1% octylglucoside. The elution fractions 

20 were neutralized and analyzed as described above, pooled, filtered through a 0.2 micron 
filter, aliquoted, and stored at -70**C. 

When fiill-length human sonic hedgehog (Shh) was expressed in High Five™ 
insect cells, over 95% of the N-terminal Augment was detected by Western blotting in a 
form that was cell- associated. By SDS-PAGE, the purified protein migrated as a 

25 single sharp jband with apparent mass of 20 kDa (Figxirel, lane c). The protein 
migrated faster by about 0.5 kDa than a soluble version of the protein that had been 
produced in Kcoli (Figure 1, lanes b-d), consistent with data published previously (19). 
Similarly as described (19), the soluble and membrane-bound Shh proteins were also 
readily distmguishable by reverse phase HPLC where the tethered form eluted later in 

30 the acetonitrile gradient. The concentration of acetonitrile needed for elution of the 
membrane-bound form was 60% versus only 45% with the soluble fomv, indicating a 
significant increase in the hydrophobicity of the protein. 
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C. Mass Spectrometry Analysis of Membrane-Tethered Human Sonic Hedgehog 

Aliquots of Shh were subjected to reverse phase HPLC on a column (Vydac, 
Cat. No. 214TP104, column dimensions 4.6 mm internal diameter x 250 mm) at 
ambient temperature. Bound components were eluted with a 30 min 0-80% gradient of 
5 acetonitrile in 0.1% trifluoroacetic acid at a flow rate of 1,4 mL/min. The column 
effluent was monitored at 280 nm and 0.5 min fractions were collected. 25 p,L aliquots 
of fractions containing protein were dried in a Speed Vac concentrator, dissolved in 
electrophoresis sample buffer, and analyzed by SDS-PAGE. Hedgehog-containing 
fractions were pooled, concentrated 4-fold in a Speed Vac concentrator and the protein 

1 0 content assayed by absorbance at 280 nm using an extinction coefficient of 1 .33 for a 1 
mg/mL solution of Shh. Samples were subjected to ESI-MS on a Micromass Quattro II 
triple quadrupole mass spectrometer, equipped with an electrospray ion source. A 
volume of 6 ^iL of HPLC-purified hedgehog was infused directly into the ion source at 
a rate of 10 ^L/min using 50% water, 50% acetonitrile, 0.1% formic acid as the solvent 

15 in the syringe pump. Scans were acquired throughout the sample infusion. All 
electrospray mass spectral data were acquired and stored in profile mode and were 
processed using the Micromass MassLynx data system. 

Peptides from an endoproteinase Lys-C digest of pyridylethylated-Shh were 
analyzed by reverse phase HPLC in line with the Micromass Quattro II triple 
20 quadrupole mass spectrometer. The digest was separated on a Reliasil C,8 column 
using a Michrom™ ultrafast Microprotein Analyzer system at a flow rate of 50 ^iL/min 
v^th a 5-85% acetonitrile gradient m 0.05% trifluoroacetic acid. Scans were acquired 
from m/z 400-2000 throughout the run and processed as described above. 

Sequencing of the N-terminal peptide from tethered Shh was performed by Post 
25 Source Decay (PSD)-measurement on a Voyager-DE™ STR (PerSeptive Biosystems. 
Framingham, MA) time-of-flight (TOP) mass spectrometer using a-cyano-4- 
hydroxycinnamic acid as the matrix (22,23). Exactly 0.5 |iL of HPLC-purified 
endoproteinase Lys-C peptide was mixed with 0.5 \iL of matrix on the target plate. To 
cover the entire spectrum of fragment ions, the mirror voltages were decreased from 20 
30 to 1:2 kv in 11 steps. 

Electrospray ionization mass spectrometry data for the soluble and membrane- 
bound forms of Shh showed primary species with masses of 19560 and 20167 Da, 
respectively (Figure 2). The measured mass of 19560 Da matches the predicted mass 
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for Shh starting with Cys-1 and terminating with Gly-174 (calc. mass of 19560.02 Da). 
In contrast, the 20167 Da mass did not agree with any available prediction nor could the 
difference in the masses of the tethered and soluble forms, 607 Da, be accounted for by 
any known modification or by aberrant proteolytic processing. Previously, Porter ct al. 
5 (18) demonstrated that Drosophila hedgehog contained a cholesterol moiety and thus it 
was possible that the mass difference in the human system was due, at least in part, to 
cholesterol (calculated mass for esterified cholesterol is 368.65 Da). The presence of a 
minor component in the mass spectrum of tethered Shh at 1 9796 Da, which differs from 
the primary peak by 371 Da, supported this notion. 

10 Further evidence for cholesterol was obtained by treating the tethered Shh with 

a mild alkali under conditions that can break the cholesterol linkage without disrupting 
peptide bonds (18), and then reanalyzing the reaction products by mass spectrometry' 
(MS). Briefly, insect cell-derived Shh was treated with 50 mM KOH, 95% methanol 
for 1 h at ambient temperature and then analyzed by ESI-MS or digested with 

15 endoproteinase Lys-C and subjected to LC (liquid chn)matography)-MS on the 
Micromass Quattro II triple quadrupole mass spectrometer. For samples subjected to 
LC-MS, the proteins were first treated with 4-vinylpyridine. Base treatment shifted the 
mass by 387 Da, which is consistent with the loss of cholesterol plus water (see Table 
3). The mass of soluble Shh was not affected by base treatment. Together, these 

20 observations suggested that the membrane-tethered human Shh contained two 
modifications, a cholesterol and a second moiety with a mass of 236 Da. The similarity 
in mass between this value and the mass of an added pahnitoyl group (238 Da) 
suggested that the protein might be palmitoylated. More accurate estimates of the 
mass, discussed below, revealed a correlation within 0.1 Da of the predicted mass of a 

25 palmitoyi moiety. 



30 



TABLE 3. Characterization of tethered Shh by MS. Calculated mass values were 
determined using average residue masses in part a and monoisotopic protonated masses 
in part b. 

Protein Mass (Da) 

Calculated ' Measured 

a. KOH-treated Shh 

no tether (-treatment) 19560.02 19560 
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no tether 

tethered 

tethered 



(+treatment) 
("treatment) 
(^treatment) 



19560.02 
20167,14 
19798.49 



19561 
20167 
19780 



b. 



N-tenminal endoproteinase Lys-C peptide (MH^* 
no tether 983.49 
tethered ' 1221.72 



983.50 
122L79 



* All mass values for peptides described herein are protonated masses 
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Subsequently, we determined that tethered Shh could be fractionated into 
subspecies by HPLC with a modified elution gradient and we developed a simple 
HPLC assay for quantifying the various forms. Results from these analyses are shown 
in Figure 3. In this assay, the unmodified Shh elules first (peak 1), then cholesterol- 
modified Shh elutes (peak 2), and finally product containing both cholesterol and 
palmitic acid-modified Shh elutes (peak 3). The complex shape of peak 3 reflects the 
presence of a modified form of the palmitoyl group that was identified through 
sequencing by MALDI PSD measurement. The variant was 2 Da smaller than 
predicted and may therefore contain an unsaturated bond (data not shown). 

Localization of the Palmitic Acid Modification Within the Human Sonic 
Hedgehog Sequence 

The site of palmitoylation within the human sequence was identified using a 
combination of peptide mappmg and sequence analysis. Figure 4B shows results from 
a peptide mapping analysis of the soluble protem wdth an LC-MS readout. Mass data 
accounting for over 98% of the soluble Shh sequence could be accounted for from the 
analysis. The peak noted vwth an asterisk corresponds to the N-terminal peptide 
(residues 1-9 plus 4-vinylpyridme, observed mass 983.50 Da, calculated mass 983.49 
Da; Table 3). In the corresponding analysis of the tethered product (Figure 4A), this 
peptide was missing and instead a more hydrophobic peptide with mass of 1221.79 Da 
was observed (noted with asterisk). 
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The 1221.79 Da moiety is consistent with the presence of a modified form of 
the N-terminal peptide, i.e. 983.49 Da for the peptide component plus 238.23 Da. The 
1221.79 Da peptide was ne?ct subjected to sequence analysis by MALDI PSD 
measurement. The resulting PSD spectrum is shown in Figure 5. Ions corresponding 
5 to bl, b2, b4, b5, b8 + H2O, yS, y7, y5, y4, y3, y2, and yl fi-agments were detected 
which confirmed the sequence. In addition, the bl and b2 ions indicated that the 
pyridylethylated Cys-1 adduct was palmitoylated. Only ions containing Cys-1, 
contained the added 238.23 Da mass. 

Since cysteine is a normal site of palmitoylation for proteins in v/vo, it was not 
10 surprising to find the novel adduct attached to the N-terminal cysteine. However, two 
pieces of evidence suggested that the lipid was attached to the amino group on the 
cysteme and not the thiol. First, in the peptide mapping study, we used 4-vinylpyridine 
as a spectroscopic tag to monitor free thiol groups (27). Pyridylelhylation is highly 
specific for cysteine thiols and adds a 105 Da adduct that can be detected by MS. The 
15 observed Cys-1 -containing Segments in the PSD spectrum contained both palmitoyl 
and pyridylethyl modifications, implying the presence of a fipee thiol group. Second, 
the tethered Shh was subjected to automated N-terminal Edman sequencing and no 
sequence was obtained, suggesting blockage at the N-terminal a-amine. By contrast, 
the corresponding soluble form of Shh can be sequenced readily, 

20 

Example 2: Human Sonic Hedgehog can be Modified with Palmitic Acid in a Cell- 
Free System 

Soluble Shh was labeled with ^H-palmitic acid in a cell-firee system using a 
modified version of a published procedure (24). A crude microsomal firaction from rat 

25 liver was prepared by subjecting a liver homogenate to sequential centrifugation at 
3000 X g for 10 min, 9000 x g for 20 min, and 100,000 x g for 30 min. The 100,000 x 
g pellet was suspended in 10 niM HEPES pH 7.4, 10% sucrose and again centrifuged at 
100,000 X g fdx 20 min. The final pellet (derived fiom 10 g of liver) was suspended in 
3 mL of 20 mM Tris-HCl pH 7.4, 150 mM NaCl, 1 mM EDTA, 10 ^ig/mL leupeptin, 

30 0. 1 5% Triton X- 1 00, aliquoted, and stored at -70^C. Reactions containing 3 ^ig Shh, 1 
^iL of rat microsomes, 50 ng/mL Coenzyme A (Sigma), 0.3 mM ATP; 20 mM Tris-HCl 
pH 7.4, 150 mM NaCl, 1 mM EDTA, 10 ^g/mL leupeptin, and 0.5 MCi-[9,10-^H]- 
palmitic acid (50 Ci/mmol; New England Nuclear) were performed at room 
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temperature for 1 h. Reactions were stopped with reducing electrophoresis sample 
buffer, subjected to SDS-PAGE on a 10-20% gradient gel, and visualized by 
fluorography. 

As shown in Figure 1 (lane e), Shh is readily labeled with the radioactive tracer. 
5 None of the fa hundred other proteins in the reaction mixture were labeled (see the 
corresponding Coomassie blue-stained gel profile in lane j), indicating a high degree of 
specificity of the palmitoylation reaction. As further evidence for the specificity of the 
palmitoylation reaction, we tested two Shh variants in which die site of palmitoylation 
had been eliminated. Figure 1 (lane f) shows results from the analysis of a truncated 
10 form of soluble Shh that was lacking the first 10 amino acid residues of tiie mature 
sequence) and lane g, of a mutant form of Shh containing, at its N-terminal end, a 
single Cys-1 to Ser point mutation. Neither of the variants were labeled. 

The significance of the N-terminal cysteine as the site of lipid derivatization is 
highlighted by tiie fact tiiat vwld type soluble Shh is readily labeled while die N- 
15 terminal cysteine to serine mutant is not. The inability to label the N-terminal serine 
mutant argues against a simple reaction mechanism where the pahnitoyl moiety is 
directiy attached to the N-terminal a-amine since under the test conditions the serine 
should have substituted for the cysteine. 

We also tested the role of the fi-ee N-terminus using a form of soluble Shh with 
20 an N-terminal histidine (His)-tag extension. The soluble human Shh used in these 
studies had b|en produced mitially as a His-tagged fusion protem with an enterokinase 
cleavage site at the junction of the mature sequence and was then processed with 
enterokinase to remove the His tag. The His-tagged Shh was not palmitoylated despite 
the presence of the fi*ee thiol group of the cysteine (See Figure 1, lane i). While we 
25 cannot rule out the possibility that die N-terminal extension sterically inhibits 
pahnitoylation from occurring, Cys-1 is at the PI ' position of the enterokinase cleavage 
site and is readily accessible to enzymatic processing. Thus it appears that both the 
thiol and a-amine of Cys-1 contribute to the palmitoylation reaction. Since all known 
palmitoylation reactions target die side chains of Cys, Ser, or Thr residues, we infer tiiat 
30 the modification on hedgehog starts with tiie formation of a thioester intermediate, and 
tiiat die pahnitoyl moiety is then transferred to the N-termmus dirough the formation of 
a cyclic intermediate. This hypothesis was confirmed during studies of the 
modification of human Sonic hedgehog using palmityol Coenzyme A (See Example 8). 
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Example 3: Demonstration of Increased Potency of Naturally Ocurring Fatty- 
Acylated Human Sonic Hedgehog in a Cell-Based (C3H10T1/2) Assay 

5 Shh was tested for function in a cell-based assay measuring alkaline 

phosphatase induction in C3H10T1/2 cells (25) with a 5 day readout. The assay was 
preformed in a 96-well format. Samples were run in duplicate. For tethered Shh (100 
fig/mL), the samples were first diluted 200-fold with normal growth medium then 
subjected to serial 2-fold dilutions down the plates. Wells were normalized for 
1 0 potential effects of the added octylglucoside by including 0.005% octylglucoside in the 
culture medium. Blocking studies using the neutralizing murine mAb 5E1 (26) were 
performed by mixing Shh with serial dilutions of the antibody for 30 min at ambient 
temperature in culture medium prior to adding the test samples to the plates. 

In this assay, soluble human Shh produces a dose-dependent response with an 
15 IC50 of 1 jig/mL and a maximal signal at 3 fig/mL (Figure 6A). Tethered human Shh, 
with a cholesterol attached at the C-terminus and a palmitoyl group at the N-terminus, 
similarly produced a dose-dependent response in the assay but with an IC50 of 0.03 
jig/mL and a maximal signal at 0.1 i^g/mL, indicating that it was about 30 times as 
potent as soluble Shh. To verify that the observed activity was hedgehog specific, we 
20 tested whether the activity could be inhibited with the anti-hedgehog neutralizing mAb 
5E1. Both soluble and tethered Shh were inhibited by 5E1 treatment (Figure 6B). 
Inhibition of the tethered Shh required a tenth as much 5E1 consistent with its increased 
activity in the assay. 

Tethered Shh was tested m a receptor binding assay, monitoring its ability to 
25 hindpatched, using a modified version of a recently published assay (10). The tethered 
Shh showed dose-dependent binding to cells expressing patched mth an apparent IC50 
of 400 ng/mL (Figure 7). In the same assay, soluble Shh bound to patched with an 
apparent IC^ of 150 ng/mL, indicatmg that the tethered fomi bound only slightly less 
tightly to its receptor. 

30 

Example 4; Analysis ofTethered Human Sonic Hedgeh g after Reconstitution into 
Liposomes 
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This example illustrates that reconstitution experiments into positively and 
negatively charged liposomes by detergent dilution over a wide range of lipid : protein 
ratios (w/w) from 1 ; 1 to 100 : 1 had no effect on tethered Shh activity in the 
C3H10T1/2 assay, 

5 Reconstitution into phospholipid-containing liposomes provides a useful 

formulation for lipid-containing proteins because it allows a lipid-contaming protein to 
exist in a near normal setting. To test whether such a formulation was viable for 
tethered Shh we utilized a detergent dilution method to incorporate the protein mto a 
liposome (60), where preformed liposomes are mixed with octylglucoside and the 

10 protein of interest, and then the detergent is diluted below its critical micelle 
concentration, thus driving the reconstitution. While any of a large number of pure or 
lipid mixtures can be utilized, we selected two commercially available mixtures as 
models; a negatively charged liposome kit containing egg L-a-phosphatidylcholine, 
dicetyl phosphate, and cholesterol (Cat. No. L-4262; Sigma, St. Louis, MO), and a 

15 positively charged liposome kit consisting of egg phosphatidyl choline, stearlyamine, 
and cholesterol (Cat. No L-4137, Sigma). 

Briefly, the lipids were transferred mto a Pyrex tube, dried under a stream of 
nitrogen, and residual solvent removed by lyophilization. The lipid was suspended in 
10 mM HEPES pH 7.5, 100 mM NaCl, 2.0% octylglucoside, vortexed, and sonicated 
20 until the suspension had turned opalescent in appearance. The lipid was then filtered 
through a 0.2 micron filter. Aliquots of tethered Shh. firom baculovirus-infected High 
Five"^ insect cells, in octylglucoside were treated with a 400-, 1000-, 5000-, and a 
20000-fold excess of lipid (w/w) and after a 15 min preincubation the samples were 
diluted and assayed for activity in the C3H10T1/2 assay. 

V 

25 Neither the positive nor the negative liposome treatment had any affect on the 

activity of the hedgehog indicating that a lipid carrier was a viable fomulation. To 
verify that the hedgehog indeed had become reconstituted, parallel samples were 
subjected to centrifugation under conditions where the tethered Shh would normally 
pellet and the liposomes would float to the surface of the sample. Under these 

30 conditions the tethered Shh floated to the surface, indicating that reconstitution had 
occurred. 
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Example 5: Characterization of Membrane-Tethered Human Sonic Hedgehog 
from Mammalian (EBNA-293) Cells 

In order to assess whether palniitoylation was a general modification pathway 
for Sonic hedgehog or whether it was specific to insect cell production, the protein was 
5 also produced in a mammalian system in EBNA-293 cells. For expression of full- 
length Shh in manunalian cells, the 1.4 kb NotI fragment containing full-length Shh 
(See Example 1) was cloned into a derivative of the vector, CH269 pCEP4 (Invitrogen, 
San Diego, CA (21)). The construct was transfccted into EBNA-293 cells using 
lipofectamine (Life Technologies, Inc.) and the cells were harvested 48 h post- 
10 transfection. The expression of surface Shh was verified by FACS and by Western blot 
analysis. 

Tethered Shh from EBNA-293 cells was fractionated by reverse phase HPLC 
on a narrow bore C4 column (See Figure 3). Peaks were analyzed by ESI-MS (parts a 
and b of Table 4) or by MALDI-TOF MS on a Finnigan LaserMat mass spectrometer 

1 5 using a-cyano-4-hydroxycinnamic acid as the matrix (part c of Table 4). By SDS- 
PAGE, the protein migrated slightly faster than soluble Shh, it was retarded on the C4 
column in the reverse phase HPLC analysis, and, by mass spectrometry, it contained an 
ion corresponding to the palmitic acid plus cholesterol modification. However, unlike 
the insect cell-derived product where over 80% of the product contained both the 

20 palmitic acid and cholesterol modification, the HPLC elution profile and data from 
mass spectrometry revealed that most of the mammalian cell-derived protein lacked the 
palmitoyl moiety (see Table 4 and Figure 3C). That is, in peak 2 from EBNA-293 cells 
the ratio of clipped (des-1-10) versus intact protein by MS signal was 50% whereas for 
peak 1 only about 10% of the Shh was clipped. Interestingly, both the insect cell and 

25 manMnalian cell-derived products showed comparable activity in the C3H1 OTl/2 assay 
suggesting that both the cholesterol, and the cholesterol plus palmitic acid 
modifications are functional. Whether the second lipid attachment site is used simply 
to further stabilize the association of the protein with membrane or whether it plays a 
more active role and affects its conformation or protein-protein contacts remains to be 

30 determined. 



Fatty acid derivatization of proteins is a common post translational modification 
that occurs late in the maturation processes (28,29). For cysteine derivatives, the 
process is dynamic involving separate enzymes that add and remove the modification 
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on the sulfhydryl group. The most common functions of such derivatization (e.g., 
palmitoylation) are to alter the physico-chemical properties of the protein, i.e., to target 
a protein to its site of function, to promote protein-protein interactions, and to mediate 
protein-membrane interactions (30). For hedgehog, while the difference in the extent of 
5 palmitoylation in the insect and manwnalian cell-derived preparations (80% in insect 
cells versus 30% in mammalian cells) was surprising, we do not know if it is 
biologically significant or whether it simply reflects differences in the cellular 
machinery of the two test systems for adding and removing palmitic acid. The 
difference in the extent of modification in the insect and manmxalian cells is unlikely to 
10 be species related since tethered Drosophila hedgehog that was produced in insect cells 
lacked palmitic acid (19) despite having the identical N-terminal sequence. 



15 



20 



25 



30 



TABLE 4. Mass spectrometry analysis of EBNA-293 -derived tethered human Sonic 
hedgehog. 



Protem 

a. bacterial expressed (no tether) 

b. baculovirus expressed (tethered) 

+ palmitic acid 

+ palmitic acid/cholesterol 

c. EBNA.293 cell expressed (tethered) 

peak 1 (9% of total hedgehog) 
no tether 

no tether (des 1-9) 
peak 2 (61% of total hedgehog) 
+ cholesterol 
+ cholesterol (des 1-10) 
peak 3 (30% of total hedgehog) 
+ palmitoyl/cholesterol 



Mass (Da) 



Calculated 
19560.02 



19798.49 
20167,14 



19560.02 
18700.02 

19928.67 
18912.48 

20167.14 



Measured 
19560 



19796 
20168 



19581 
18712 

19934 
18889 

20174 



35 



Example 6: Lipid Modifications of Rat Sonic Hedgehog 
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This Example illustrates that a variety of lipids become linked to a soluble 
version of rat Sonic hedgehog when the rat Shh gene encoding residues 1-174 is 
expressed in High Five™ insect cells, essentially as for the full-length human Shh 
described in Example 1. The lipid modification renders this fraction membrane- 
5 associated. The N-terminal fragment (residues 1-174 of unprocessed rat Sonic 
hedgehog) differs by only 2 amino acid residues to that of the N-terminal fragment of 
human Sonic 'hedgehog. In the rat Sonic hedgehog N-terminal fragment, threonine 
replaces serine at position 44, and aspartic acid replaces glycine at position 173. When 
rat Sonic hedgehog lacking the autoprocessing domain is expressed in the High-Five™ 

10 insect cell/baculovirus expression system, the majority of the protein is secreted into the 
culture medium since it lacks the ability to attach a cholesterol moiety to the C- 
terminus. This soluble form has a specific biological activity (measured by the 
C3H10T1/2 alkaline phosphatase induction assay of Example 3) that was similar to that 
of the soluble, N-terminal fragment of human Sonic hedgehog expressed and purified 

1 5 from £ colL 

However, a small fraction of the total protein remains associated with the insect 
cells. The cell-associated rat Sonic hedgehog protein was purified essentially as 
described in Example 1, and was found to be significantly more active in the alkaline 
phosphatase assay (data not presented) than the soluble, N-terminal fragments of either 

20 human or rat Sonic hedgehog purified from E. coli and the High-Five™ insect 
celLTjaculovirus expression system, respectively. Subsequent analyses of the rat Sonic 
hedgehog N-terminal fragments by HPLC and electrospray mass spectrometry (as 
described in Example 1) suggests that the protein is lipid-modified and that there was 
more than one type of lipid modification. Supporting evidence includes the following 

25 observations: 

1. The cell-associated forms elute later than the soluble, N-terminal fragments 
of human and rat sonic hedgehog from a C4 reverse phase HPLC column (Vydac 
catalog number 214TP104) developed with a linear 30 min 0-70% acetonitrile gradient 
in 0.1% trifluoroacetic acid; 

30 2. The masses of the cell-associated forms are consistent with that expected for 

the lipid-modified proteins, as shown in Table 5. 
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TABLE 5: Masses of various lipid-modified fonns of rat Sonic hedgehog. 



Protein Adduct Expected Mass*(MH''^ Observed Mass 

5 unmodified , none 19,632.08 19,632 

myristoy). CH3(CHj),2CO- 19,842.50 19,841 

palmitoyl- CH3(CH2)|4C0- 19,870.55 19,868 

stearol- CH5(CH,)„C0- 19,898.60 19,896 

arachidoyl- CH3(CH2),3CO- 1 9,926.66 1 9,925 



10 ♦ Average masses were used in calculating the expected masses 

The location of the lipid moiety was determined using a combination of 
sequence analysis and peptide mapping. Automated N-terminal Edman sequencing of 
the lipid-modified forms indicated that the N-terminus was blocked, suggesting that the 
15 lipid was attached to the a-amine of the N-terminal cysteme. Endo-Lys-C peptide 
mapping, MALDI-TOF mass spectrometry and MALDI PSD analysis (as described in 
Example 1) of the 4-vinylpyridine alkylated lipid-modified forms, were used to confirm 
the location of the lipid modifications and to determine their exact masses. 

The masses of the N-terminal peptides (residues 1-9 inclusive plus 4- 
20 vinylpyridme attached to the thiol side chain of the N-terminal cysteine) carrying the 
lipid modifications were consistent within 0.1 Da with that expected for the lipid- 
modified peptides as shown in Table 6. 

TABLE 6: Masses of the N-terminal peptides iosolated from various lipid-modified 
25 fomis of rat Sonic hedgehog. 

Protein Adduct Expected Mass* (MH^) Observed Mass 

(MH^ 

myristoyl- CH3(CH2),2CO- 1 193,69 1 193.76 

30 palmitoyl- CH3(CH2)mCO- 1221.72 1221.65 

stearoyl- CH3(CH2),,CO- 1249.75 1249.71 

* Monoisotopic masses were used in calculating the expected masses 

In addition to the lipid-modified peptides shown in Table 6, peptides with 
35 masses of 1191.74, 1219.84 and 1247.82 were also detected. These masses are 
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consistent with unsaturated forms of myristate, palmitate and stearate, respectively, 
although the position of the double bond in the alkyl chain was not determined. These 
observations indicate that both saturated and unsaturated fatty acids can be attached 
covalently to the N-terminal cysteine. For both the saturated and unsaturated Hpid- 
5 modified peptides, MALDI PSD analysis as described in Example 1 confirmed that the 
lipids were attached covalently to the N-terminal cysteine residue. 

Example 7: Lipid Modification of Indian Hedgehog 

To assess whether the palmitoylation reaction was unique to human Shh or 
10 whether it might occur on other hedgehog proteins, we tested whether human Indian 
hedgehog (expressed in E. coli as a His-tagged fiision protein with an enterokinase 
cleavage site immediately adjacent to the start of the mature sequence, and purified 
exactly as foir recombinant human Sonic hedgehog (See Example 9)) could be 
palmitoylated using the assay described in Example 2. Human Indian hedgehog was 
1 5 modified (See Figure 1 , lane h), indicatmg that palmitoylation is likely to be a common 
feature of hedgehog proteins. The ability to directly label Shh and Ihh with radioactive 
pahnitic acid in a celi-fi-ee system provided a simple screen for amino acids involved in 
the modification reaction. Moreover. Indian hedgehog palmitoylated by the method 
described in Example 8 was significantly more potent in the C3H10T1/2 assay than the 
20 unmodified Ihh. 

Example 8: Lipid Modifications of Sonic Hedgehog using Acyl-Coenzyme A 

The in vitro acylation of a protein containing an N-terminal cysteine can be 
accomplished via a two-step, chemical reaction with a fatty acid-thioester donor. In the 

25 first step, the acyl group of the thioester donor transfers to the sulfliydryl of the N- 
terminal cysteine on the protein by a spontaneous transesterification reaction. 
Subsequently, the acyl moiety undergoes a S to N shift to the a-amine of the N- 
terminal cysteine to form a stable amide bond. Direct acylation of an amine function on 
a protein may ilso occur with prolonged incubation with a thioester, but the presence of 

30 a cysteine on the protein will accelerate the reaction and allow control over the 
acylation site. In the present examples, conmiercially available Coen2yme A 
derivatives (Sigma Chemical Company, St. Louis MO) are utilized, but other thioester 
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groups would also achieve the same result. In fact, certain thioester leaving groups, 
such as thiobenzyl esters, would be expected to react more rapidly. Internal cysteine 
residues may also promote acylation to neighboring lysmes (i.e., as in an internal 
cysteine-lysine pair) and this can be conveniently tested using synthetic peptides. 
5 Secondary acylations occuring on a protein during reaction with thioesters may be 
prevented by controlling the buffer composition, pH, or by site-directed mutagenesis of 
the neighboring lysines. 

In preliminary analysis of the effect of acylation on the ability of human Sonic 
hedgehog to induce alkaline phosphatase in C3H10T1/2 cells, reaction mixtures 

10 contained 1 mg/mL human Sonic hedgehog (51 jiM), 500 \iM of the particular, 
commerciaUy available, acyl-Coenzyme A (compounds tested included acetyl-CoA 
(C2:0), butyrjrl-CoA (C4:0), hexanoyl-CoA (C6:0), octanoyl-CoA (C8:0), decanoyl- 
CoA (C10:0),' lauroyl-CoA {C12:0), myristoyl-CoA (C14:0), palmitoyl-CoA (C16:0), 
pahnitoleoyl-CoA (C16:l), stearoyl-CoA (C18:0), arachidoyl-CoA (C20:0), behenoyl- 

15 CoA (C22:0), lignoceroyl-CoA (C24:0), succinyl-CoA, and benzoyl-CoA), 25 mM 
DTT, and 50 mM Na2HP04 pH 7.0. The reactions were incubated at room temperature 
for 3 h and then analyzed immediately (without purification) for bioactivity in the 
C3H10T1/2 assay as described in Example 3. Samples for analysis by reverse phase 
HPLC and other physical methods were usually stored at -70*C. HPLC analysis was 

20 carried out on a Vydac C4 reverse phase column (4.6 mm internal diameter x 250 mm, 5 
micron particle) with a 40 min gradient of 5% acetonitrile to 85% acetonitrile in 
aqueous 0.1% TFA, at a flow rate of 1 mL/min. The effluent was monitored at 280 nm, 
and fiactions were collected in some experiments and analyzed for hedgehog protein on 
SDS-PAGE with detection by Coomassie staining and by Western blotting. 

25 Comparison of the activity of the various reaction mixtures (Figure 10) indicates 

that a chain length of between 12 and 18 carbons is optimal in inducing high alkaline 
phosphatase activity as compared to the unmodified protein. Increasing the chain 
length fiirther resulted in an apparent reduction in activity, and the presence of a double 
bond in the unsaturated palmitoleoyl-CoA (CI 6:1) gave the same activity as the fiilly 

30 saturated palmitoyl-CoA (C16:0). Upon reverse phase HPLC analysis of the reaction 
mixtures, we observed that many of the shorter chain length acyl-CoA derivatives had 
not reacted with the hedgehog protein, and therefore the dependence of biological 
activity shown in Figure 10 was not a true reflection of the acyl chain length. 
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In order to obtain data on the true activity of the modified proteins, and on the 
dependence of activity on acyl chain length, we developed methods for the synthesis 
and purification of the individual N-terminal acylated forms. Palmitoylated, 
myristoylated, lauroylated, decanoylated, and octanoylated human Sonic hedgehog 
5 proteins, carrying a single acyl chain attached to the a-amine of the N-terminal 
cysteine, were produced in reaction mixtures containing 0,80 mg/raL (41 ^M) human 
Sonic hedgehog, 410 |iM (10-fold Molar excess) of either palmitoyl-CoA, myristoyl- 
CoA, or lauroyl-CoA, or 4.1 mM (100-fold Molar excess) of either decanoyl-CoA or 
octanoyl-CoA, 25 mM DTT (for reaction mixtures containirig pahnitoyl-CoA, 

10 myrisloyl-CoA, or lauroyl-CoA) or 0,5 mM DTT (for reaction mixtures containing 
decanoyl-CoA or octanoyl-CoA) , and 40 mM Na2HP04 pH 7.0. Reaction mixtures 
were incubated at 28°C for 24 h. Reaction of the N-terminal cysteine with the acyl 
thioesters results in the transfer of the acyl group to the sulfhydryl by a spontaneous 
transesterification reaction, which is followed by a S to N shift to the a-amine to form a 

15 stable amide linkage. The free sulfhydryl then undergoes a second transesterification 
reaction, yielding a protein with a fatty acyl group attached via a thioester linkage to the 
sulfhydryl. The thioester-linked acyl group was removed by adding consecutive 0.11 
volume of 1 M Na2HP04 pH 9.0, and 0,1 1 volume of 1 M hydroxylamine (0.1 M final 
concentration) followed by incubation at 28°C for 18 h, which leaves only the acyl 

20 amide attached to the protein (62). 0.25 volume of 5% octylglucoside was then added 
(1% final concentration) and the mixture mcubated for 1 h at room temperature. The 
proteins were then purified in the presence of 1% octylglucoside using SP-Sepharose 
Fast Flow '(Pharmacia) and Bio Scale S (Biorad) cationic ion exchange 
chromatographies. The purified proteins were dialyzed against 5 mM Na2HP04 pH 5.5, 

25 150 mM NaCl, 1% octylglucoside, 0.5 mM DTT, and were stored at -70°C. The 
presence of octylglucoside was required to maintain fiill solubility; removal of the 
detergent by dilution and dialysis resulted in a 75%, 41%, and 15% loss of the 
palmitoylated, myristoylated, and lauroylated proteins, respectively. ESI-MS of the 
HPLC-purified protems confirmed their integrity : palmitoylated Sonic hedgehog, 

30 measured mass = 19798, calculated mass = 19798.43; myristoylated Sonic hedgehog, 
measured mass = 19770, calculated mass = 19770.33; lauroylated Sonic hedgehog, 
measured mass = 19742, calculated niass = 19742.33; decanoylated Sonic hedgehog, 
measured mass = 19715, calculated mass = 19714.28; octanoylated Sonic hedgehog, 
measured mass = 19686, calculated mass = 1 9686.23. 
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Analysis of the various acylated forms of human Sonic hedgehog in the 
C3H10T1/2 assay (Figure 1 1) indicated that the activity of the proteins was dependent 
upon the chain length. The palmitoylated, myristoylated, and lauroylated proteins 
showed approximately equal activity with EC50 values of 5-10 ng/mL (100-200-fold 
5 increase in potency as compared to the unmodified protein). Decanoylated human 
Sonic hedgehog, with an EC50 value of 60-70 ng/mL (15-30-fold increase in potency as 
compared to the unmodified protein), was less active than the palmitoylated, 
myristoylated, and lauroylated forms, while the octanoylated form was the least active 
with an EC50 of 100-200 ng/mL (10-fold increase in potency as compared to the 
10 unmodified protein). All of the acylated forms were more potent than the unmodified 
protein which had an EC50 of 1000-2000 ng/mL. In addition to the decrease in ECjo, 
the palmitoylated, myristoylated, and lauroylated proteins induced approximately 2- 
fold more alkaline phosphatase activity than the unmodified protein, while the 
decanoylated and octanoylated proteins induced approximately 1, 5-fold more. 

15 In addition to the increase in potency of the myristoylated form of human Sonic 

hedgehog observed in the C3H10T1/2 assay, this form is significantly more potent than 
the uTunodified protein at inducing ventral forebrain neurons in explants of embryonic 
stage Ell rat brain telencephalon. Incubation of Ell telencephalic explants with 
various concentrations of uiunodified, or myristoylated Sonic hedgehog, and 

20 subsequent staining of the explants for the products of the dlx and islehl/2 genes 
(markers of ventral forebrain neurons), indicates that while induction by the immodified 
protein is observed first at 48 nM, induction by the myristyolated form is observed first 
at 3 nM. Moreover, while the unmodified protein induces restricted expression at 3070 
nM, the myristoylated protein induces widespread expression at only 48 nM. A similar 

25 increase in potency was observed when explants of embryonic stage E9 presumptive 
telencephalon were incubated widi either the unmodified, or myristoylated proteins. 
Staining of the explants for the product of the Nkx2.I gene (an early marker of ventral 
forebrain neurons), mdicated that the unmodified protein induced Nkx2.1 first at 384 
nM, while for the myristoylated protein expression of Nkx2.1 was observed first at 12 

30 nM. Moreover, at 48 nM myristoylated Sonic hedgehog, expression of Nkx2.1 was 
widespread while it was undetectable at this concentration using the uiunodified form. 

Additionally, myristoylated human Sonic hedgehog has been shown to be 
significantly more protective than the unmodified protein in reducing the lesion volume 
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which results from administration of malonate into the striatum of the rat brain (See 
Example 16). 

Example 9: Chemical Derivatives of the N-terminal Cysteine of Human Sonic 
5 Hedgehog 

A. General Methods 

Alleviation of Proteins. Samples containmg about 20 \ig of the protein in 50 fxL of 6 M 
guanidine hydrochloride, 50 mM NajHPO^ pH 7.0, were treated with 0.5 \iL of 4- 
vinylpyridine for 2 .h at room temperature. The S-pyridylethylated protein was 
10 precipitated by addition of 40 volumes of cooled eihanol. The solution was stored at - 
20 "^C for 1 h and then centrifuged at 14,000 x g for 8 min at 4°C. The supematants 
were discarded and the precipitate was washed with cooled ethanol. The protein was 
stored at -20''C. 

15 Peptide Mapping. Alkylated protein (0.4 mg/mL in 1 M guanidine hydrochloride, 20 
mM Na2HP04 pH 6.0) was digested with endo Lys-C (Wako Pure Chemical Industries, 
Ltd.) at a 1 : 20 ratio. The digestion was conducted at room temperature for 30 h. The 
reaction was stopped by acidification with 5 ixL of 25% trifluoroacetic acid. The digest 
was analyzed on a Waters 2690 Separation Module with a Model 996 photodiode airay 

20 detector. Prior to injection, solid guanidine hydrochloride was added into the digest to 
a concentration of 6 M to dissolve insoluble material. A reverse phase Vydac C,8 (2.1 
mm internal diameter x 250 mm) column was used for separation, with a 90 min 
gradient of 0.1% trifluoroacetic acid/acetonitrile and 0.1% trifluoroacetic 
acid/acetonitrile at a flow rate of 0.2 mL/min. Individual peaks were collected 

25 manually for mass analysis. 

Mass Dete rmination. The molecular masses of intact proteins were determined by 
electrospray ionization mass spectroscopy (ESI-MS) on a Micromass Quattro II triple 
quadrupole mass spectrometer. Samples were desalted using an on-line Michrom 
30 Ultrafast Microprotein Analyzer system with a Reliasil C^ (1 mm internal diameter x 50 
mm) column. The flow rate was 20 [iL/min. All electrospray mass spectral data were 
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processed using the Micromass MassLynx data system. The molecular masses of 
peptides were determined by matrix assisted laser desorption ionization time-of-fight 
mass spectrometry (MALDI-TOF-MS) on a Voyager-DE™ STR (PerSeptive 
Biosystems, Framingham, MA), Sequencing of the modifed peptide was performed by 
5 Post-source decay (PSD) measurement on the same instrument. a-Cyano-4. 
hydroxycinnamic acid was used as the matrix. 

N-terminal Sequencing. Proteins were sequenced by Edman degradation on a Perkin- 
Ehner Applied Biosystems model 477A Pulsed-Liquid Protein Sequencer. PTH- 
10 thiaproline was made on line by directly loading thiaproline (thia2olidine.4<arboxyIic 
acid) into the sample loading cartridge of the sequencer. 

Bacterial express ion and purification of wild type soluble human Sonic hedgehog N- 
terminal fragment use d for chemical modification. Bacterial pellets from cells 

15 expressing Shh at 4-5% of the total protein were thawed, resuspended in lysis buffer 
(25 mM NajHPO^pH 8, 150 mM NaCl, 1 mM EDTA, 1 mM PMSF, 0.5 mM DTT) at a 
ratio of 1 : 4 (w/v) and lysed by two passes through a Gaulin press (mfg. by APV 
Rannie, Copenhagen, Denmark) at 12,000 p.s,i.. All subsequent purification steps were 
performed at 2-8^*0 unless indicated otherwise. The homogenate was centriftiged at 

20 19,000 X g for 60 min and MES 0.5 M pH 5, was added to the resulting lysate at a ratio 
of 1 : 10 (v/v). The lysate (at pH 5.5) was loaded onto an SP Sepharose Fast Flow 
(Pharmacia, Piscataway, NJ) column (4 g £. coli wet weight/mL resin) equilibrated 
with 25 mM Na2HP04pH 5.5, 150 mM NaCl. The column was washed with 4 column 
volumes (CV) of equilibration buffer, then with 3 C V of 25 mM Na2HP04 pH 5.5, 200 

25 mM NaCl, 0.5 mM DTT, and Histag-Shh was eluted with 800 mM NaCl in the same 
buffer. Elutioh fractions were analyzed for absorbance at 280 nm and by SDS-PAGE. 
Imidazole (IM stock solution at pH 7) and NaCl (5 M stock solution) were added to a 
pool of the peak Shh containing fractions from the SP Sepharose eluate to give final 
concentrations of 20 mM and 1 M respectively, and this material was loaded onto a 

30 NTA-Ni agarose (Qiagen, Santa Clara, CA) column (20 mg/mL resin) equilibrated with 
25 mM Na^HPO.pH 8, 1 M NaCl, 20 mM imidazole. 0.5 mM DTT. The column was 
washed with 5 CV of the same buffer and Histag-Shh eluted with 3 CV 25 mM 
NajHPO, pH 8, 1 M NaCl, 200 mM imidazole, 0.5 mM DTT. The protein content in 



i 



wo 99/28343 



PCT/US98/25676 



the eluate pool from the NTA-Ni column was determined by absorbance at 280 nm. 
The pool was warmed to room temperature and an equal volume of 2.5 M sodium 
sulfate was added. The Phenyl Sepharose step was performed at room temperature. 
The material was loaded onto a Phenyl Sepharose Fast Flow (Pharmacia, Piscataway, 
5 NJ) column (20 mg/mL resin) equilibrated in 25 mM Na2HP04 pH 8, 400 mM NaCl, 
1 .25 M sodium sulfate, 0.5 mM DTT. Histag-Shh was eluted with 25 mM Ma^HPO^ pH 
8, 400 mM NaCl, 0.5 mM DTT.. Typically, we recovered 2-3 g of His-tagged Shh 
from 0.5 kg of bacterial paste (wet weight). Hie product was filtered through 0.2 \im 
filter, aliquoted, and stored at -70°C. The His-tagged Shh was about 95% pure as 
10 determined by SDS-PAGE. As a further assessment of the characteristics of the 
purified product, samples were subjected to evaluation by electrospray ionization mass 
spectrometry (ESI-MS). Approximately 50% of the protein was missing the N-terminal 
methionine. 

To cleave off the hexahistidine tag, enterokinase (Biozyme, San Diego, CA) 

15 was incubated with the Histag-Shh at an enzyme : Shh ratio of 1 : 1000 (w/w) for 2 h 
at 28°C. Uncleaved Histag-Shh and firee Histag were removed by passing the digest 
through a second NTA-Ni agarose column (20 mg Shh/mL resin). Prior to loading, 
imidazole (1 M stock solution at pH 7) and NaCl (5M stock solution) were added to the 
digest to give final concentrations of 20 mM and 600 mM, respectively. This material 

20 was loaded onto a NTA-Ni column equilibrated in 25 mM Na2HP04 pH 8, 600 mM 
NaCl, 20 mM imidazole, 0.5 mM DTT and the flow through collected. The column 
was washed with 1 CV of the same buffer and pooled with the flow through. MES (0.5 
M stock solution at pH 5) was added to the NTA-Ni agarose unbound fraction to a final 
concentration of 50 mM and two volumes of water were added. This material was 

25 loaded onto a second SP Sepharose Fast Flow column (20 mg/mL resin) equilibrated 
with 5 mM Na2HP04 pH 5.5, 150 mM NaCl, 0.5 mM DTT. The column was washed 
with 3 CV of equilibration buffer and 1 CV of the same buffer containing 300 mM 
NaCl. Shh was eluted with 5 mM NajHPO^ pH 5.5, 800 mM NaCl, 0.5 mM DTT, 
Atomic absorption data revealed that Shh at this stage contained 0.5 mol equivalent of 

30 Zn^'. An equimolar concentration of ZnClj was added to the Shh eluant and the protein 
dialyzed against 5 mM Na2HP04 pH 5.5, 150 mM NaCl. 0.5 mM DTT. The resulting 
Shh was > 98 % pure as characterized by SDS-PAGE, size exclusion chromatography 
(SEC), and ESI-MS and, by atomic absorption, contained between 0.9 and 1.1 Zn 
'VShh. 
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ESI-MS data for Histag Shh and products resulting after removal of the histag 
are summarized in Table 7. 

TABLE 7. Characterization of Shh by ESI-MS. 



Protein 

10 

Histag-Shh (-Met) 
(Intact) 

15 Enterokinase-cleaved Shh 



Mass (Da) 



Calculated 

21433.82 
21565.01 

19560.02 



Measured 

21434 
21565 

19560 



20 



25 



30 



B. Specific Chemical Modifications 

Modification of human Sonic hedgehog with A^-ethvlmaleimide. Purified Shh in 5 mM 
Na2HP04 pH-. 5.5, 150 mM NaCl, 0.5 mM DTT was treated with 10 mM M 
ethylmaleimide for 1 h on ice and then dialyzed into 5 mM Na2HP04 pH 5.5, 150 mM 
NaCL The MALDI-TOF-MS data showed that the iV-ethyhnaleimide (NEM)-modified 
protein had an increase in mass of 126 Da, which indicates that only one cysteine 
residue in Shh was modified by the reagent. N-terminal sequencing data showed that 
the protein is sequencible and that an unusual peak, probably PTH-NEM-Cys related, 
was detected at the first cycle (data not shown). Mass spectrometric analysis of the 
pyridylethylated-NEM-Shh under denaturing conditions showed that only two cysteine 
residues in the protein were alkylated, confirmmg that only the thiol group of the N- 
terminal cysteine residue was modified by NEM under native conditions (Table 8). The 
other two cysteine residues, which are apparently buried in the hydrophobic core of the 
protein, cannot be modified without prior denaturation. 
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TABLE 8, Characterization of NEM-modified Shh by MS. 



Protein 


Mass (Calculated) 


Mass (Measured) 


Pyridylethylated NEM Shh 




19895 Da 


if containing 2 free Cys residues 


19895 Da 




if containing 3 free Cys residues 


20000 Da 





When tested in the C3H10T1/2 assay (See Example 3) the A^-ethylmaleimide-modified 
5 hedgehog protein was equal in activity to the unmodified protein. This demonstrates 
that a free sulfhydryl at the N-tcrminus of hedgehog is not required for activity and that 
the A^-ethyhnaleimide moiety is hydrophobic enough to confer some activity on 
hedgehog compared to other more hydrophilic modifications, such as conversion of 
Cys-1 to His or Asp, which produce a reduction in activity. 

10 

Modification of human Sonic hedgehog with formaldehyde to form an N-terminal 
thiaproline, and with acetaldehvde and butvraldehvde to form N-terminal thiaproline 
derivatives . For formaldehyde modification, purified Shh at 3 mg/mL in 5 mM 
Na2HP04 pH 5,5, 150 mM NaCl, 0.5 mM DTT was treated with 0.1% formaldehyde. 

1 5 with or without 10% methanol, at room temperature for 1 to 6 h. The protein was either 
dialyzed against 5 mM Na2HP04 pH 5.5, 150 mM NaCl, or was purified on a CM- 
Poros column (Perseptive Biosystems) as described below and then dilayzed against 5 
mM Na2HP04 pH 5.5, 150 mM NaCl. For modification with acetaldehyde or 
butyraldehyde, purified Shh at 3 mg/mL in 5 mM Na2HP04 pH 5.5, 150 mM NaCl, 0.5 

20 mM DTT wa^ treated with 0. 1 % acetaldehyde or butyraldehyde at room temperature for 
I h and then the protein purified on a CM-Poros column. ESI-MS data for the 
formaldehyde, acetaldehyde-, and butyraldehyde-treated forms of the protein indicated 
that their masses were 13 Da, 27 Da, and 54 Da higher, respectively, than the 
unmodified protein (Table 9). 

25 

TABLE 9, Expected and observed masses of human Sonic hedgehog treated with 
formaldehyde, acetaldehyde, and butyraldehyde. 
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Protein Expected mass »rMH^^ Observed mass *f MH^^ 

Unmodified 19560.02 19560 

Formaldehyde-treated 19572.03 19573 

5 Acetaldehyde*treated 19586.06 19587 

Butyraldehyde-treated 19614.1 1 19614 
* Average masses were used in calculating the expected masses 

For the formaldehyde-treated protein, peptide mapping, as described above, 
demonstrated that the site of the modification occurred in the peptide spanning the first 

10 9 N-terminal residues, and that the exact mass increase was 12 Da. The results of 
MALDI-PSD MS studies of this peptide indicated that the modification occurred on 
Cys-1, and could be explained by a modification of the N-terminal a-amine and the 
thiol side chain of Cys-1 to form a thiaproline (See Figure 12). The structure of the 
thiaproline v\^as confirmed by automated N-tenninal Edman sequencing using "on-line" 

15 prepared PTH-thiaproline as a standard. For the acetaldehyde- and butyraldehyde- 
treated proteins, the ESI-MS data were consistent with the modifications occuring by 
means of the same chemistry as for the reaction vwth formaldehyde, although the exact 
site of modification has not been established. When tested in the C3H10T1/2 cell- 
based assay, the formaldehyde-, acetaldehyde-, and butyraldehyde-modified proteins 

20 were approximately 8-foId, 2-fold, and 3-foId, respectively, more potent than 
unmodified Shh. 



Modification of human Sonic hedgehog mXh A^-isopropvliodoacetamide. This example 
shows that modification of human Shh with a hydrophobic derivative of iodoacetamide 

25 can enhance the potency of the protein as compared to the unmodified Shh. Purified 
Shh (1 mg/mL in 5 mM Na2HP04 pH 7.0, 150 mM NaCl, 0.1 mM DTT) was incubated 
with 1 mM //-isopropyliodoacetamide (NIPIA) at 4°C for 18 h. DTT was then added to 
10 mM final concentration and the sample was dialyzed extensively against 5 mM 
Na^HPO^ pH 5.5, 150 mM NaCl, 0.5 mM DTT. The sample was purified on SP 

30 Sepharose Fast Flow resin and dialyzed further against 5 mM Na2HP04 pH 5.5, 150 
mM NaCl, 0.5, mM DTT. ESI-MS data indicated complete conversion to a species with 
a mass of 19660, corresponding to the predicted mass value (19659) for the singly 
modified protein. Specific modification of the N-terminal cysteine was confirmed by 
peptide mapping of proteolytic fragments. When tested in the C3H10T1/2 cell-based 
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assay, the NIPIA-modified human Shh was approximately 2-fold more potent than the 
unmodified protein. While the modification of the protein resulted in only a modest 
increase in potency, it is expected that modification of the protein with long chain alkyl 
iodoacetamide derivatives will result in hydrophobically-modified forms of the protein 
5 with much greater increases in potency, possibly akin to the 100-200-fold increase 
observed for the palmitoylated, myristoylated, and lauroylated Shh proteins (See 
Example 8). 

Modification of human Sonic hedgehog with l-bromo-2-butanone to form a six- 
10 membered hydrophobic ring at the N-terminus. A thiomorpholinyl- 
(tetrahydrothiazinyl-) derivative of Shh was prepared by incubating human Shh-N (3 
mg/mL in 5 mM NajHPO^ pH 5.5, 150 mM NaCl, 0.15 mM DTT) with 11 mM 1- 
bromo-2-butanone at room temperature for 60 min, followed by reduction with 5 mM 
NaCNBH3 at room temperature for 60 min. The reaction product was purified on a 
15 CM-Poros column (Perseptive Biosystems) as described below and was dialyzed 
against 5 mM Na2HP04 pH 5.5, 150 mM NaCl. 0.5 mM DTT. ESI-MS and proteolytic 
peptide mapping data indicated that the product was a mixture of the expected 
thiomorpholinyl derivative (calculated mass = 19615, observed mass = 19615) and two 
fornis of the protein both with 16 additional mass units. One of these forms is 
20 presumably the uncyclized keto-thioether intermediate. The mixture was tested in the 
C3H10T1/2 assay which indicated that it was approximately 5-fold more potent than 
the immodified protein. 

Example 10. Genetically Engineered Mutations of Human Sonic Hedgehog 

25 

A* Genetically Engineered Mutations of the N-terminal Cysteine 

In this' example, we show that specific replacement of the N-t^rminal cysteine of 
human Sonic hedgehog (Cys-1) by single and multiple hydrophobic amino acid 
residues results in increased potency as compared to the wild type protein in the 
30 C3H10T1/2 cell-based assay described in Example 3. 
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Construction of Shh Cvs-I mutants. The 584 bp Ncol-Xhol restriction fragment 
carrying the His-tagged wild type Shh N-terminal fragment from p6H-SHH was 
subcloned into the pUC-derived cloning vector pNN05 to construct the plasmid 
pEAG649. Cys-1 mutants of soluble human Shh were made by xmique site elimination 
5 mutagenesis of the pEAG649 plasmid template using a Pharmacia kit following the 
manufacturer's recommended protocol. In designing the mutagenic primers, if a desired 
mutation did not produce a restriction site change, a silent mutation producing a 
restriction site change was introduced into an adjacent codon to facilitate identification 
of mutant clones following mutagenesis. To avoid aberrant codon usage, substituted 

10 codons were selected from those occurring at least once elsewhere in the human Shh 
cDNA sequence. The following mutagenic primers were used: (1) for CIF: 5* GGC 
GAT GAC GAT GAC AAA TTC GGA CCG GGC AGG GGG TTC 3^ (SEQ ID NO: 

which introduces an Apol site to make pEAG837; (2) for CI I: 5* GGC GAT GAC 

GAT GAC AAA ATA GGA CCG GGC AGG GGG TTC 3* (SEQ ID NO: ), 

15 which loses an RsrII site to make pEAG838; and (3) for CIM: 5* GGC GAT GAC GAT 

GAC AAA ATG GGC CCG GGC AGG GGG TTC GGG 3* (SEQ ID NO: ), 

which loses both RsrII and Avail sites to make pEAG839. Mutations were confirmed 
by DNA sequencing through a 180 bp Ncol-Bglll restriction fragment carrying the 
mutant SHH proteins* N-termini in plasmids pEAG837-839. Expression vectors were 

20 constructed by subcloning each mutant plasmid's 180 bp Ncol-Bglll fragment and the 
404 bp BgUI-XhoI fragment from pEAG649 into the phosphatase-treated 5.64 kb Xhol- 
Ncol pETlld vector backbone of p6H-SHH. Presence of the introduced restriction site 
change was reconfirmed in the expression vector for each Cys-1 mutant (CIF in 
pEAG840, ClI in pEAG841, and CIM in pEAG842). Expression vectors were 

25 transformed into competent £. coli BL21(DE3)pLysS (Stratagene) following the 
manufacturer's reconunended protocol and selected on LB agar plates containing 100 
|ig/mL ampicillin and 30 ^ig/mL chloramphenicol. Individual colonies were selected 
and transformed bacteria were grown to an A^ of 0.4-0.6 and induced for 3 h with 0.5 

mM IPTG. Bacterial pellets were analyzed for expression of the mutant proteins by 
30 reducing SDS-PAGE and by Western blotting. 

A soluble human Shh mutant with multiple N-terminal hydrophobic 
substitutions ;(Cin) was made by unique site elimination mutagenesis using a 
Pharmacia kit following the manufacturer's recommended protocol. In designing the 
mutagenic primers, if a desired mutation did not produce a restriction site change, a 
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silent mutation producing a restriction site change was introduced into an adjacent 
codon to facilitate identification of mutant clones following mutagenesis. To avoid 
aberrant codon usage, substituted codons were selected from those occurring at least 
once elsewhere in the human Shh cDNA sequence. The following mutagenic primer 

5 was used on the CIF template plasmid pEAG837 for Clll: 5' GCG GCG ATG ACG 
ATG ACA AAA TCA TCG GAC CGG GCA GGG GOT TCG GG 3' (SEQ ID NO: 

), which removes an AppI site to make pEAG872. Mutations were confirmed by 

DNA sequencing through a 0.59 kb Ncol-Xhol restriction fragment carrying the mutant 
cm Shh. An expression vector was constructed by.subcloning the 'mutant plasmid' s 

10 Ncol-Xhol fragment into the phosphatase-treated 5.64 kb Xhol-Ncol pETlld vector 
backbone of p6H-SHH, Presence of the introduced restriction site change was 
reconfirmed in the expression vector for the CI II mutant, pEAG875. The expression 
vector was transformed into competent £ coli BL21(DE3)pLysS (Stratagene) 
follov«ng the manufacturer's recommended protocol and selected on LB agar plates 

15 containing 100 ^g/mL ampicillin and 30 ^g/mL chloramphenicol. Individual colonies 
were selected and transformed bacteria were grown to an A^^ of 0.4-0.6 and induced 

for 3 h with 0.5 mM IPTG. Bacterial pellets were analyzed as described above to 
confirm expression of mutant Shh protein. 



20 Purification of Cvs-1 mutants of human Sonic hedgehog. The His-tagged mutant 
hedgehog proteins were purified from the bacterial pellets as described for the wild type 
protein above except for two modifications. (1) The Phenyl Sepharose step was 
eliminated and instead the protein pool from the first NTA-Ni agarose column was 
dialyzed into 25 mM NajHPO^ pH 8, 400 mM NaCl, 0.5 mM DTT in preparation for 

25 the enterokinase cleavage step. (2) The final ion exchange step was changed from step 
elution on SP-Sepharose Fast Flow to gradient elution from a CM-Poros column 
(Perseptive Biosystems). This was carried out in 50 mM Na2HP04 pH 6.0 with a 0-800 
mM NaCl gradient over 30 column volumes. The pooled peak fractions from this step 
were dialyzed into 5 mM Na2HP04 pH 5.5, 150 mM NaCl and were stored at -80<> C. 

30 Mass spectrometry of the purified proteins gave the predicted mass ions for each 
purified form. 
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Activity of the Cvs-1 mutants of human Sonic hedgehog . As shown in Table 10, 
mutation of the N-terminal cysteine has a significant effect on the potency of the 
resulting hedgehog protein in the C3H10T1/2 assay. For single changes, potency 
generally correlates with the hydrophobicity of the substituted ammo acid, that is 
5 phenylalanine, and isoleucine give the greatest activation, methionine is less activating, 
while histidine and aspartic acid diminish activity compared to the wild type cysteine. 
Replacing the cysteine with two isoleucines gives an additional increase in activity over 
the single isoleucine substitution. Given that nine amino acids are categorized as more 
hydrophobic than cysteine (Proteins: structures and molecular properties, 2"** ed, 1993, 

10 T. E. Creighton, W. H. Freeman Co. page 154), the substitutions tested above are 
clearly not an exhaustive survey of the possible mutations at the N-terminus that can 
give rise to more active forms of hedgehog. However, the results demonstrate that 
activation is not restricted to a single amino acid structure and that substitution of more 
than one amino acid can give a further increase in potency. Therefore, one skilled in 

15 the art could create forms of hedgehog with other amino acid substitutions at the N- 
terminus that would be expected to have greater potency than the wild type protein, 

TABLE 10. Relative potency of amino acid modifications of human Sonic hedgehog in 
the C3H10T1/2 assay. 

20 



N-TERMINUS 


RELATIVE POTENCY 


C (wild type) 


IX 


M 


2X 


F 


4X 


I 


4X 


II 


lOX 



r 
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B. Genetically Engineered Mutations of Internal Residues 

Construction of the C1II/A169C mutant . The soluble human Shh mutant C1II/A169C 
5 (with cysteine substituted for the dispensable C-terminal residue A 169 which is 
predicted to have a high fractional solvent accessibility) was made by unique site 
elimination mutagenesis using a Pharmacia kit following the manufacturer's 
recommended protocol and employing the mutagenic oligo design principles described 
above. The following mutagenic primer 5' GAG TCA TCA GCC TCC CGA TTT 

1 0 TGC GCA CXC CGA GTT CTC TGC TTT CAC C 3' (SEQ IDNO: ) was used on 

cm Shh template pEAG872 to add an Fspl site to make pSYS049. The C1II/A169C 
mutations were confirmed by DNA sequencing through a 0.59 kb Ncol-Xhol restriction 
fragment. The expression vector pSYSOSO was constructed by subcloning the Ncol- 
Xhol fragment into the phosphatase-treated 5.64 kb Xhol-Ncol pETlId vector 

15 backbone of p6H-SHH. Presence of the introduced restriction site change was 
reconfirmed in the expression vector. The expression vector was transformed into 
competent £. coli BL21(DE3)pLysS, colonies were selected, induced, and screened for 
Shh expression as described above. 

20 Purification of the C1II/A169C mutant. The C111/A169C mutant was purified as 
described in Example 9 for wild type Shh except with the following modifications. (1) 
EDTA was left out of the lysis buffer, (2) the order of the NTA-Ni and SP Sepharose 
steps were switched and the Phenyl Sepharose step was omitted, (3) after clarification 
of the lysed bacteria by centrifiigation, additional NaCl was added to the supernatant to 

25 a final concentration of 300 mM, (4) the elution buffer from the NTA-Ni column 
contained 25 mM Na2HP04 pH 8.0, 200 mM imidazole, 400 mM NaCl, (5) the elution 
pool from the NTA-Ni column was diluted with 3 volumes of 100 mM MES pH 5.0 
prior to loading onto the SP Sepharose column, (6) prior to addition of enterokinase, the 
SP Sepharose elution pool was diluted with half a volume of 50 mM Na2HP04 pH 8.0, 

30 and (7) the DTT in the elution buffer from the final SP Sepharose colxmm contained 0.2 
mM DTT and the elution pool from this step was aliquoted and stored at -70**C. 
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Hvdrophobic modification and activity of the C1II/A169C mutant. For modification 
with A^-(l-pyrene) maleimide (Sigma), purified C1II/A169C at 4.6 mg/mL in 5 
mM NajHPO^pH 5.5, 800 mM NaCl, 0.2 mM DTT was diluted with an equal volume 
of 50 mM MES pH 6,5 and to this a twentieth of a volume of pyrene maleimide from a 
5 2.5 mg/mL stock in DMSO was added. The sample was incubated for 1 h at room 
temperature in the dark. At this time additional DTT was added to 0.5 mM and the 
sample incubated further for an additional hour at room temperature. The modified 
protein was tested directly for activity in the C3H10T1/2 assay as described in Example 
3. Prior to modification, the specific activity of the protein was ECjo = 0.22 ng/mL, 

10 while after treatment with pyrene maleimide the specific activity was increased to EC50 
= 0.08 ng/mL. Increases in the specific activity of the modified product by up to 3-fold 
were observed frequently indicating that the addition of the hydrophobic group near the 
C-terminus of Shh resulted in a further increase in activity as compared to the CI II 
starting material. When compared to the vnld type unmodified Sonic hedgehog protein, 

15 the J^-(l -pyrene) maleimide-modified ClII protein was approximately 30-fold more 
potent. While pyrene maleimide provided a simple test system for evaluating 
modification at this site, other hydrophic maleimides or other cysteine targeted 
chemistries can also be used. 

20 Example 11: Comparison of the Potency of Various Hydrophobically-Modified 
Forms of Human Sonic Hedgehog in the C3H10T1/2 Assay 

The activity of various hydrophobically-modified forms of human Sonic 
hedgehog (prepared using the chemistries and genetic engineeering methods described 
in Section V) was tested in the C3H10T1/2 assay as described in Example 3. 

25 The derivatives were assayed over a concentration range as described in 

Example 3. The concentration of hedgehog derivative that resulted in 50% of the 
maximum response in the assay was compared to the wild type concentration. The 
relative activities are shown in Table 1 1, below, and in Figure 13. 

30 TABLE 11. Relative Potency of Hedgehog Derivatives in the C3H10T1/2 assay. 
Modification EC<;. (x^fold more potent than wild type Shh^ 
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C:16palmitoyl 100 

C:14myristoyl 100 

C:121auroyl 100 

C:10decanoyl 33 

5 Isoleucyl-isoleucyl with Al 69C pyrenyl 30 

C:8octanoyl 10 

Isoleucyl-isoleucyl 10 

C:0thiaprolyl 8 

Thiomoipholinyl 5 

10 Phenylalanyl 4 

Isoleucyl 4 

N-isopropylacetamidyl 2 

Methionyl 2 

N-ethylmaleimidyl 1 
1 5 Cysteinyl (wild type) 1 

Aspartyl <1 

Histidyl ' <1 



The C3H10T1/2 assay demonstrates that a wide variety of hydrophobic 
20 modifications to hedgehog increase the protein's activity when compared to the wild 
type, unmodified protein. Hydrophilic modifications (aspartic acid and histidine) do 
not have this effect. 

Example 12: Evaluating the Efficacy of Hydrophobically-Modified Human Sonic 
25 Hedgehog in a Rat Malonate-Induced Striatal Lesion Assay 

Injection of malonate, an inhibitor of the mitochondrial enzyme succinate 
dehydrogenase, into the rat striatum (the rodent equivalent of the primate caudate and 
putamen) causes degeneration of striatal medium spiny neurons. In humans, 
degeneration of medium spiny neurons in the caudate and putamen is the primary 
30 pathological feature of Huntington's disease. Thus, the malonate-induced striatal lesion 
in rats can be used as a model to test whether hydrophobically-modified hedgehog 
protems can prevent the death of the neurons which degenerate in Huntington's disease. 
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Sprague-Dawley rats were injected with various concentrations of 
hydrophobically-modified human Sonic hedgehog in the striatum using stereotaxic 
techniques. Stereotaxic injections (2 iiL) were performed imder sodium pentobarbital 
anesthesia (40 mg/kg) and placed at the following coordinates: 0.7 mm anterior to 
5 bregma, 2.8 mm lateral to the midline, and 5.5 mm ventral to the surface of the skull at 
bregma. At various times (usually 48 h) after injection of the hydrophobically- 
modified protein, rats were anesthetized with isoflurane and given a stereotaxic 
injection of malonate (2 ^mol in 2 iiL) at the same coordinates in the striatum. Four 
days after malonate injection, rats were sacrificed and their brains removed for 
10 histological analysis. Coronal sections were cut through the striatum at a thickness of 
25 nm and stained for cytochrome oxidase activity to distinguish lesioned fi-om 
unlesioned tissue. The volume of the lesion in the striatum is measured using an image 
analysis system. 

The effect of hydrophobically-modified human Sonic hedgehog protein in the 
15 malonate-induced rat striatal lesion model is shown in Figure 14. Unmodified Sonic 
hedgehog (prepared as described in Example 9), myristoylated Shh (prepared as 
described in Example 8), and the ClII mutant of Shh (prepared as described in Example 
10) all reduced lesion volume to a similar extent in this model. However, the 
hydrophobically-modified proteins (myristoylated Shh and ClII Shh) showed an 
20 increase in potency relative to the unmodified Sonic hedgehog. 

Example 13: N-Octybnaleimidederivitizationof sHh-N 
For a 1 mg/ml final concentration 

1) Make a 20 mM solution of octylmaleimide (m.w, - 209) in DMSO (--4.2 
25 mg/ml). 

2) Dilute stock of 10 mg/ml sHh-N (in 5 mM NaP04 pH 5.5, 150 mM NaCl, 
0.5 mM DTT) 10-fold with PBS (Gibco product # 20012-027, pH 7.2) to give a 1 
mg/ml (or SO xM) sHh-N solution, [NOTE: DTT, which competes with sHh-N for 
maleimide in the subsequent reaction, is also 50 ^iM in this solution.] 

30 3) Immediately add 1/200 vol. of octylmaleimide to the 1 mg/ml sHh-N (i.e. 5 

^l/lml). This gives a 2:1 molar ratio (100 jiM:50 \iM) of octyknaleimide to sHh-N. 
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4) Mix this solution by gentle inversion of the tube and incubate for 1 hour at 
room temperature. 

5) Finally, 1/1000 vol. of 0,35 M DTT was added to each tube to scavenge any 
remaining octylmaleimide and to serve as a reductant. 

5 6) For a vehicle control, combine a solution of vehicle (5mM NaP04 pH 5.5, 

1 50 mM NaCl, 0.5 mM DTT) with PBS (Gibco product # 20012-027, pH 7.2) in a 1 : 10 
ratio. Add 1/400 vol. of 20niM octylmaleimide in DMSO and a 1/400 vol. of DMSO to 
give a jRnal concentration of 50 ^tM N-octylmaleimide and 0.5% DMSO. Finally, add 
1:1000 vol. of0.35MDTT. 

10 

Approximate composition of the 1 mg/ml N-octvlmaleimide sHh-N solution 
PBS (-pH 7.2) 

50 ^iM sUh-N conjugated to N-octyhnaleimide 
50 pM DTT conjugated to N-octylmaleimide 
15 350 pM DTT 
0.5% DMSO 

Approximate composition of the N-octvlmaleimide vehicle solution 
PBSC'-pH 7.2) 
20 50 pM DTT conjugated to N-octylmaleimide 
350 pMDTT 
0.5% DMSO 

For a 3 mg/ml final concentration 

25 1) MfiSke a 60 mM solution of octylmaleimide (m.w. = 209) in DMSO (-12.6 

mg/ml). 
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2) Dilute stock of 10 mg/ml sHh-N (in 5 mM NaP04 pH 5.5, 150 mM NaCl, 
0.5 mM DTT) 10-fold with PBS (Gibco product # 20012-027, pH 7.2) to give a 3 
mg/ml (or 150 uM) sHh-N solution. [NOTE: DTT, which competes with sHh-N for 
maleimide in the subsequent reaction, is also 150 in this solution.] 

5 3) Immediately add 1/200 vol. of octylmaieimide to the 3 rag/ml sHh-N (i.e. 5 

Hl/lml). This gives a 2:1 molar ratio (300 jiM:150 \iM) of octylmaieimide to sHh-N. 

4) Mix this solution by gentle inversion of the tube and incubate for I hour at 
room temperature. 

5) Finally, add 1/1000 vol. of 0.35 M DTT to each tube to scavenge any 
1 0 remaining octyhnaleimide and to serve as a reductant. 

6) For a vehicle control, combine a solution of vehicle (5mM NaP04 pH 5.5, 
150 mM NaCl, 0.5 mM DTT) with PBS (Gibco product # 20012-027, pH 7.2) in a 3:7 
ratio. Add 1/400 vol. of 60mM octyhnaleimide in DMSO and a 1/400 vol. of DMSO to 
give a final concentration of 150 ^iM N-octylmaleimide and 0.5% DMSO. Fmally, add 

15 1:1000 vol. of0.5M DTT. 

Approximate composition of the 3mg/ml N-octvlmaleimide sHh-N solution 
PBS (^pH 7.2) 

1 50 [iM sHh-N conjugated to N-octylmaleimide 
20 1 50 \iM DTT conjugated to N-octylmaleimide 
500 [iM DTT 
0.5% DMSO 

Approximate composition of the N-octvlmaleimide vehicle solution 
25 PBS (-pH 7.2); 

150 ^M DTT conjugated to N-octylmaleimide 
500 fiMDTT 



wo 99/28343 



PCT/US98«5676 



t -87- 
0.5%DMSO 

0.5% DMSO 
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Equivalents 

While we have described a number of embodiments of this invention, it is apparent 
to persons having ordinary skill in the art that our basic embodiments may be altered to 
25 provide other embodiments that utihze the compositions and processes of this 
invention. Therefore, it will be appreciated that the scope of this invention includes all 
alternative embodiments and variations which are defined in the foregoing specification 
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and by the claims appended hereto; and the invention is not to be limited by the specific 
embodiments presented in the examples. 
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WHAT IS CLAIMED IS: 

1. An isolated, protein comprising an N-terminal amino acid and a C-terminal amino 
acid, wherein the protein is selected from the group consisting of: 

(a) a protein with an N-terminal cysteine that is appended with at least one 
5 hydrophobic moiety; 

(b) a protein with an N-terminal amino acid that is not a cysteine appended with 
at least one hydrophobic moiety; and 

(c) a protein with at least one hydrophobic moiety substituted for the N- 
terminal amino acid. 

10 

2. The protein of claim 1, wherein the hydrophobic moiety is a peptide comprising at 
least one hydrophobic amino acid. 

15 

3 . The protein of claim 1 , wherein the hydrophobic moiety is a lipid. 



4. The protein of claim 1 , wherein the protein further comprises a hydrophobic moiety 
20 substituted for, or appended to, the C-terminal ammo acid. 



5. The protein of claim 1 , wherein the protein is an extracellular signaling protein. 



6. The protein of claim 1 , wherein the N-termmal amino acid is a functional derivative 
25 of a cysteine. 



wo 99/28343 



PCT/US9a/25676 



; .93- 

7. The protein of claim 1, wherein the protein is modified at both the N-terminal 
amino acid and the C-terminai amino acid. 

8. The protein of claims 4 or 7, wherein the protein has a hydrophobic moiety 
5 substituted for, or appended to, at least one amino acid intermediate to the N- 

terminal and C-terminal amino acids. 

9. The protein of claim 1, wherein the protein has a hydrophobic moiety substituted 
for, or appended to, at least one amino acid intermediate to the N-terminal and C- 

10 terminal amino acids. 

10. The protein of claim 3, wherein the lipid moiety is a fatty acid selected from 
saturated and unsaturated fatty acids having between 2 and 24 carbon atoms. 

15 11. The protein of claun 1, wherein the protein is a hedgehog protein obtainable from 
a vertebrate source. 

1 2. The protein of claim 1 1 , wherein the hedgehog is obtainable from a human or rat. 

20 13. The protein of claim 1 1, wherem the vertebrate hedgehog is selected from the 
group consisting of Sonic, Indian, and Desert hedgehog. 

14. The protein of claim 1, fiirther comprising a vesicle in contact with the 
hydrophobic moiety. 

25 



15. The protein of claim 14, wherein the vesicle is selected from the group consisting of 
a cell membrane, a micelle, and a liposome. 
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16. The protein of claim 1 1, wherein the hedghog protein has an amino acid sequence 
according to any one of SEQ ID NOS: 1-4. 

17. The protein of claim 13, wherein the hedgehog protein is missing between 1 and 
5 about 10 amino acids from the C-terminus thereof, when compared to a wild-type 

hedgehog protein. 

18. The protein of claim 16, wherein the protein has at least 60% amino acid identity to 
Sonic, Indian or Desert hedgehog. 

10 

19. An isolated, protein of the form: A-Cys-[Sp3-B- [Sp]- X, wherein 
A is a hydrophobic moiety; 

Cys is a cysteine or functional equivalent thereof; 

[Sp] is an optional spacer peptide sequence; 

15 B is a protein comprising a plurality of amino acids and, optionally, another spacer 
peptide sequence; and 

X is optionally another hydrophobic moiety linked to an amino acid of protein B. 

20. The isolated protein of claim 19, wherein the isolated protein is a hedgehog 
20 protem. 

21 . The isolated protein of claim 20, wherein, if X is present, then it is cholesterol. 

22. The isolated protein of claim 19, wherein protein B is modified at at least one 
25 other amino acid with at least one hydrophobic moiety. 

23. The isolated protein of claim 19, wherein the A-Cys linkage is via an amino 
group of cysteine. 
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24. The isolated protein of claim 19, fiirther comprising a vesicle in contact 
therewith. 

5 25. The isolated protein of claim 24 , wherein the vesicle in contact therewith is 
selected from the group consisting of a cell membrane, micelle and liposome. 

26. A vesicle to which is attached a plurality of molecules, at least two of the 
plurality having the form of claim 19. 

10 

27. The vesicle of claim 26, wherein the vesicle is selected from the group 
consisting of a cell membrane, liposome and micelle. 

28. An isolated, protein having a C-terminal amino acid and an N-terminal 
15 thioproline group, said group formed by reacting an aldehyde with an N-terminal 

cysteine of the protein. 

29. An isolated, protein having a C-terminal amino acid and an N-tenninal amide 
group, said group formed by reacting a fatty acid thioester with an N-terminal 

20 cysteine of the protein. ^ 

30. An isolated, protein having a C-terminal amino acid and an N-terminal 
maleimide group, said N-terminal maleimide group formed reacting a 
maleimide group with the N-terminal cysteine of the protein. 

25 

31. The isolated protein of claims 28, 29 or 30, wherein the C-tcrminal amino acid 
of the protein is modified with an hydrophobic moiety. 



•J 



I 



wo 99/28343 PCT/US98/25676 



-96- 

32. The isolated protein of claim 3 1 , wherein the protein is a hedgehog protein. 



33. The isolated protein of claim 32, wherein the C-terminal hydrophobic moiety is 
cholesterol. 

5 

34. A method of generating a multivalent protein complex comprising the step of 
linking, in the presence of a vesicle, a hydrophobic moiety to an N-terminal 
cysteine of a protein, or a functional equivalent of the N-terminal cysteine. 



10 35. The method of claim 34, wherein the step of linking comprises linking a lipid 
moiety which is selected from saturated and unsaturated fatty acids having between 
2 and 24 carbon atoms. 

36. The method of claim 34, wherem the protein is a hedgehog protein. 

15 

37. The method of claim 36, wherein the hedgehog is selected from the group 
consisting of Sonic, Indian and Desert hedgehog. 

38. The method of claim 36, wherein the hedghog has an amino acid sequence 
20 according to any one of SEQ ID NOS: 1 -4. 

39. The method of claim 34, wherein the step of linking comprises linking with a 
vesicle selected from the group consisting of a cell membrane, liposome and 
micelle. 

25 

40. A method for modifying a physico-chemical property of a protein, comprising 
introducing at least one hydrophobic moiety to an N-terminal cysteine of the protein 
or to a functional equivalent of the N-terminal cysteine- 
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41 . The method of claim 40, further comprising contacting the hydrophobic moiety 
with a vesicle. 

5 42. The method of claim 40, wherein the hydrophobic moiety is either a lipid 
moiety selected from saturated and an unsaturated fatty acids having between 2 and 
24 carbon atoms or is a hydrophobic protein. 

43. The method of claim 40, wherein the protem is a hedgehog protein. 

10 

44. The method of claim 43, wherein the hedgehog protein is" selected from the 
group consisting of Sonic, Indian and Desert hedgehog. 

45. The method of claim 43, wherein the hedgehog has an amino acid sequence 
1 5 accordmg to any one of SEQ ID NOS: 1 -4. 

46. The method of claim 41, wherein the step of contacting comprises contacting 
with a vesicle selected from the group consisting of a cell membrane, liposome and 
micelle. 

20 

47. A protein complex, produced by the method of claim 34. 

48. A modified protein, produced by the method of claim 40. 

25 49. The complex of claim 47, wherein the protein is selected from the group 
consisting of gelsolin; an interferon, an interieukin, tumor necrosis factor, monocyte 
colony stim\ilating factor, granulocyte colony stimulating factor, granulocyte 
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macrophage colony stimulating factor, erythropoietin, platelet derived growth 
factor, growth hormone and insulin. 

50. A method for modifying a protein having a biological activity and containing an 
5 N-terminal cysteine, comprising reacting the N-terminal cysteine with a fatty acid 

thioester to form an amide, wherein such modification enhances the protein's 
biological activity. 

5 1 . The method of claim 50, wherein the protein is a hedgehog protein. 

10 

52. The method of claim 51, wherein the hedgehog protein is selected from the 
group consisting of Sonic, Indian, Desert hedgehog, and functional variants 
thereof. 

15 53. A method for modifying a protein having a biological activity and containing 
an N-terminal cysteine, comprising reacting the N-terminal cysteine with a 
maleimide group, wherein such modification enhances the protein's biological 
activity. 

20 54. The method of claim 53, wherein the protein is a hedgehog protein. 

55. The method of claim 54, ^dlerein the hedgehog protein is selected from the 
group consisting of Sonic, Indian, Desert hedgehog, and functional variants 
thereof. 

25 

56. A method for modifying a protein having a biological activity comprising 
appending an hydrophobic peptide to the protein. 
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57. The method of claim 56, wherein the hydrophobic peptide is appended to an 
amino acid of the protein selected from the group consisting of the N-terminal 
amino acid, the C-terminal amino acid, an amino acid intermediate between the N- 
terminal amino acid and the C-terminal amino acid, and combinations of the 

5 foregoing. 

58. The method of claim 69, wherein the protein is a hedgehog protein. 

59. The method of claim 71, wherein the hedgehog protein is selected from the 
1 0 group consisting of Sonic, Indian and Desert hedgehog. 

60. A therapeutic use of the protein of any of claims 1 or 20, comprising 
administering the protein to a subject. 

15 61. A method of treating a neurological disorder in a patient comprising 
administering to the patient a protein of any of claims 1 or 20. 

62. The protein of claim 1, wherein the protein is an extracellular signaling protein. 

20 63, The method of claim 57, wherein the step of appending comprises replacing at 
least the N- terminal amino acid of the protein with at least one hydrophobic amino 
acid. 

64. The method of claim 63, wherem the at least one hydrophobic amino acid is a 
25 plurality of isoleucine residues. 

65. The method of claim 63, further comprising chemically modifying at least one 
of the isoleucine residues. 
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66. An isolated, protein having a C-terminal amino acid and an N-tenninal 
acetamide group, said group formed by reacting a substituted acetamide with an N- 
terminal cysteine of the protein. 

5 

67. An isolated, protein having a C-terminal amino acid and an N-terminal 
thiomorpholine group, said group formed by reacting a haloketone group with an N- 
terminal cysteine of the protein. 

10 68. A method for modifying a protein having a biological activity and containing 
an N-terminal cysteine, comprising reacting the N-terminal cysteine with a 
substituted acetamide group, wherein such modification enhances the protein's 
biological activity. 

1 5 69. The method of claim 68, wherein the protein is a hedgehog protein. 

70. The method of claim 69, wherein the hedgehog protein is selected from the 
group consisting of Sonic, Indian, Desert hedgehog, and functional variants 
thereof. 

20 

71. A method for modifying a protein having a biological activity and containing 
an N-terminal cysteine, comprising reacting the N-terminal cysteine with a 
haloketone group, wherein such modification enhances the protein's biological 
activity. 

25 

72. The method of claim 71 , wherein the protein is a hedgehog protein. 

73. The method of claim 72, wherein the hedgehog protein is selected from the 
group consisting of Sonic, Indian, Desert hedgehog, and functional variants thereof 
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74. A hedgehog polypeptide modified with one or more lipophilic moieties with the 
proviso tfiat, in the instance wherein the hedgehog polypeptide is the mature N- 
terminal proteolytic fragment of a hedgehog protein, the lipophilic moiety is other 

5 than a sterol at the C-terminal residue. 

75. A hedgehog polypeptide modified with one or more lipophilic moieties at internal 
amino acid residues. 

10 76, A hedgehog polypeptide modified with one or more lipophilic aromatic 
hydrocarbons. 



15 



77. The hedgehog polypeptide of any of claims 74-76 which polypeptide is provided 
as a purified protein preparation. 

78, The hedgehog polypeptide of any of claims 74-76 which polypeptide is provided 
as a pharmaceutical preparation. 



79. The hedgehog polypeptide of claim 74 or 75, wherein the lipophilic moieties are 
20 selected from the group consisting of fatty acids, lipids, esters, alcohols, cage 

structures, and aromatic hydrocarbons, 

80. The hedgehog polypeptide of claim 76 or 79, wherein the aromatic hydrocarbon is 
selected from the group consisting of benzene, peiylene, phenanthrene, 

25 anthracene, naphthalene, pyrene, chrysene, and naphthacene. 

81. The hedgehog polypeptide of claim 80, wherein the aromatic hydrocarbon is a 
pyrene. 

30 82. The hedgehog polypeptide of claim 74 or 75, wherein the lipophilic moieties are 
selected^ from the group consisting of isoprenoids, terpenes , and polyalicyclic 
hydrocarbons. 

83. The hedgehog polypeptide of claim 82, wherein the lipophilic moieties are 
35 selected from the group consisting of adamantanes, buckmmsterfullerenes, 

vitamins, polyethylene glycol, oligoethylene glycol, (Cl-C18)-alkyl phosphate 
diesters, -0-CH2-CH(OH>0-{C12-C18)-alkyl. 

84. The hedgehog polypeptide of claim 83, wherein the lipophilic moieties arc 
40 selected from the group consisting of 1- or 2-adamantylacetyl, 3-methyladamant- 

1-ylacetyl, 3-methyl-3-bromo-l-adamantylacetyl, 1-decaIinacetyl, camphoracetyl, 
camphaneacetyl, noradamantylacetyl, norbomaneacetyl, bicyclo[2.2.2.]-oct-5- 
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eneacetyl, 1 -methoxybicyclo[2.2.2.)-oct-5-ene-2-carbonyl, cis-5-norbomene- 
endo-2,3-dicarbonyl, 5-norbomen-2-yIacetyI, (IRH - )-myrtentaneacetyl, 2- 
norbomaneacetyl, anti-3-oxo-tricyclo[2.2. 1 .0<2,6> ]-heptane-7-carbonyl, 
decanoyl, dodecanoyl, dodecenoyl, tetradecadienoyl, decynoyl and dodecynoyl. 

5 

85. The hedgehog polypeptide of any of claims 74-76, wherein the lipophilic moiety 
or moieties potentiate the biological activity of the polypeptide relative to the 
modified hedgehog polypeptide. 



10 86. A method for altering the growth state of a ceil responsive to hedgehog signaling, 
comprising contacting the cell with a lipophilic-modified hedgehog polypeptide 
of any of claims 74-76. 



wo 99/28343 PCT/US98/25676 



a b 



200k 

98k: 

68K 
44K 



30K - 
15K — 



e f g h i j k 




A. 



1 



1 / 15 



WO 99/28343 



PCTAJS98/25676 




2 / 15 



wo 99/28343 



PCTAJS98/25676 




Time 



4 / 15 



wo 99/28343 



PCT/US98/25676 



54VP 

s 



PA 



^ hi hi h$ 



p Or -P- -G- -R- -G- -F- -G- -K 



600 

Mass(mfi} 



•<4VP+SH) 




1000 



1200 




5 / 15 



wo 99/28343 PCT/US98«5676 




6 / 15 



wo 99/28543 



PCT/US98/25676 




7 / 15 



wo 99/28343 



PCTAJS98/25676 



Figure 8: Alignment of N-teiminai firagments of Human Hedgehog Proteins 



Indiaji 

Sonic 

Desert 



Indian 

Sonic 

Desert 



Indian 

Sonic 

Desert 



1 50 
CGPGRWGSR RRPPRK-LVP LAYKQFSPNV PEKTLGASGR YEGKIARSSE 
CGPGRGFG-K RRHPKK-LTP LAYKQFIPNV AEKTLGASGR YEGKISRNSE 
CGPGRGPVGR RRYARKQLVP LLYKQFVPGV PERTLGASGP AEGRVARGSE 

51 100 
RFKELTPNYN PDIIFKDEEN TGADRIiMTQR CKDRLNSLAI SVMNQWPGVK 
RFKELTPNYN PDIIFKDEEN TGADRLMTQR CKDKLNAIAI SVMNQWPGVK 
RFRDLVPNYN PDIIFKDEEN SGADRIiMTER CKERVNAIAI AVMNMWPGVR 

101 150 
LRVTEGWDED GHHSEESLHY EGRAVDITTS DRDRNKYGLL ARLAVEAGFD 
LRVTEGWDED GHHSEESIiHy EGRAVDITTS DRDRSKYGML ARLAVEAGFD 
LRVTEGWDED GHHAQDSLHY EGRALDITTS DRDRNKYGLL ARLAVEAGFD 



151 176 
Indian WVYYESKAHV HCSVKSEHSA AAKTGG 
Sonic WVYYESKAHI HCSVKAENSV AAKSGG 
Desert WVYYESRNHV HVSVKADNSL AVRAGG 



SSQ ID HO: 1 
SBQ ID NO . 2 
SEQ XD KO. 3 



Gap(s)» indicated by added to facilitate alignment 



8 / 15 



wo 99/28343 



PCT/US98/25676 



Figure 9: Consensus sequence of N-terminal fragments 
SEQ ID NO. 4 



1 

CGPGR^„ 



U X4 XS 



.EKTLGASGR 



40 



PDIIFKDEEN 



X21 



80 

,GADRLMT«,R 



120 

CK«3«m5NSLAI «,VMN«,WPGVK LRVTEGWDED GHH„, „,SLHY 

160 

EGRAVDITTS DRDR^^KYG^jL ARIAVEAGFD WVYYES^, ,a,H„, 

176 

Where: 

XI is either V or G; 
X2 is either V, F or P; 
X3 is either G or V; 
X4 is either S or G; 
X5 is either R or K; 
X6 is either P, H or Y; 
X7 is either P or A; 
X8 is either R or K; 
X9 is any amino acid; 



XIO 


is 


either V or T; 




Xll 


is 


either A or L; 




X12 


is 


either S« I or 


V; 


X13 


is 


either N or G; 




X14 


is 


either P or A; 




X15 


is 


either Y or A; 




X16 


is 


either I or V; 




XX7 


is 


either A or S; 




X18 


is 


either S, N or G; 


X19 


is 


either E or D; 




X20 


is 


either T or V; 




X21 


is 


either T or S; 




X22 


is 


either Q or E; 




X23 


is 


either D or E; 




X24 


is 


either R or K; 




X25 


is 


either L or V; 




X26 


is 


either S or, A; 




X27 


is 


either Q or M; 




X28 


is 


• either S or A; 




X29 


is 


either E or Q; 




X30 


is 


either E or D; 




X31 


is 


either N or S; 




X32 


is 


either L or H; 




X33 


is 


either K or R; 




X34 


is 


either A or N; 




X35 


is 


either V or I; 




X36 


is 


either C or V; 




X37 


is 


either S or A; 




X38 


is 


either E or D; 




X39 


is 


either H or K; 




X40 


is 


either A, V or 


L; 


X41 


is 


either K or R; 


and 


X42 


is 


either T, S or 


A. 
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